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(57) Abstract 

The invention consists of (1) cloned Sorangium cellulosum polyketide synthase (PKS) biosynthetic cluster DNA; and (2) the 
nucleotide sequence and predicted protein coding sequences of the cloned DNA. The invention can be used for, but not limited to: 
(a) increasing yields of PKS product in Sorangium cellulosum (e.g., by amplification or genetic modification of the epothilone 
gene cluster or its component parts); (b) increasing yields of polyketide product in a heterologous system by transfer of the 
epothilone gene cluster or its component parts, which may be followed by amplification or genetic modification of the PKS gene 
cluster or its component parts; (c) modification of the polyketide product chemical structure in either Sorangium cellulosum or a 
heterologous host (e.g., by genetic modification of the epothilone gene cluster or its component parts; and (d) for the detection 
of genes and gene products invoh/ed In making potyketides or related molecules in other organisms (e.g., by hybridization or 
complementation assays). DNA sequence and analysis is presented for the following cosmids and plasmids: A2 cosmid; the 
pEPOcosS region (overlapping of pEPOcosS and pEPOcos/); pEPOcosS cosmid; A5 cosmid; Sau4 (10 kb plasmid). 

(57) Abrege 

Uinvention concerne: (1) un ADN biosynth^tique clon6 en grappe de polyketide synthase (PKS) de Sorangium cellulosum; et (2) 
la sequence nucl6otidique et les sequences proteiques codantes pr^vues de I'ADN clon6. L'invention peut avoir les applications 
suivantes (sans caract6re limitatif): (a) augmentation de la production de PKS chez Sorangium cellulosum (p.ex., par 
Tamplification ou la modification ^en^tique de la grappe de genes Epothilone ou de ces parties constitutives); (b) augmentation 
de la production du produit polyketide dans un systEme het^rologue par le transfert de la grappe de g6nes epothilone ou de ces 
parties constitutives, qui peut etre suivie par I'amplification ou la modification genetique de la grappe de genes PKS ou des ses 
parties constitutives; (c) modification de la structure chimique du produit polyketide soit chez Sorangium cellulosum soit chez 
un h6te heterologue (p:ex., par ['amplification ou la modification genetique de la grappe de g^nes Epothilone ou de ces parties 
constitutives); et (d) detection de gEnes et de produits geniques participant d la fabrication de polyketides ou de molecules 
correspondantes dans d'autres organismes (p.ex., par des dosages d hybridation ou d complementation). La sequence d'ADN et 
I'analyse sont presentees pour les- cosmides et les plasmides suivants: cosmide A2; region pEPOcos6 (se chevauchant avec 
pEPOcos6 et pEPOcos7); cosmide pEPOcosS; cosmide AS; Sau4 (plasmide 10 kb). 
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DHA sequences for enzymatic synthesis of polyketide or 
heteropolyketide compounds 

The present invention relates to DNA sequences for enzy- 
matic synthesis of polyketide or heteropolyketide compounds 
produced by the bacterium Sorangium cellulosum. 

Background and introduction 

This patent application describes DNA sequences for the 
enzymatic synthesis of polyketide and/or heteropolyketide 
structures synthesized by the myxobacterium Sorangium cellulo- 
sum,. Several of these compounds have known cytotoxic, immuno- 
suppressive, antibiotic and fungicidal biological activity, 
with the epothilones having been most studied and character- 
ized. The fermentation of large quantities of secondary me- 
tabolites from microorganisms, especially from myxobacteria , is 
a time consuming and difficult process that often involves com- 
plications (e.g. contamination, low product yield, difficult 
isolation and purification) . Therefore it would be advanta- 
geous to use a well -characterized organism for such fermenta- 
tions. After cloning of the desired biosynthetic genes one 
could create such an organism via genetic engineering and ma- 
nipulate the biosynthesis of the compound. Identified sequences 
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can be cloned into optimized expression vectors and generate 
recombinant cell lines that overproduce polyketide structures. 

Polyketide synthases {PKS) and non-ribosomal peptide syn- 
thetases (NRPS) represent macromolecular and multifunctional 
enzymes which are characterized by a modular architecture. PKS 
condenses activated carbonic acids (usually acetate and propi- 
onate) and reduce the resulting 2-keto acid intermediates step- 
wise in a fatty acid biosynthesis -like fashion. Responsible for 
each reaction step is a specific domain that recognizes, acti- 
vates, condenses and reduces the carbonic acid. Depending on 
the presence of these domains in the corresponding modules, 
every reduction stage can occur in the final product (Rawlings, 
Nat. Prod. Reports ii, 523-556 [1997); for a review, see Chem. 
Rev. 12, 2463-2760 [1997]). A typical example for the biosyn- 
thesis of a polyketide is the macrolide antibiotic erythromycin 
(Staunton and Wilkinson, Chem. Rev. 97, 2611-2630 [1997]), 
NRPSs are also modular enzymes and condense via peptide bonds 
amino acids to low molecular weight bioactive sxibstances like 
bacitracin or tyrocidin. Typical domains of these systems acti- 
vate the amino acid and condense it with the growing peptide 
chain. Methylations , epimerisations and modifications via addi- 
tional protein domains are possible (Stachelhaus and Marahiel, 
FEMS Microbiol Lett. 3-14 [1995]). Both types of enzymes 

(NRPS and PKS) share the modular organization of the proteins 
in which specific catalytic domains are responsible for recog- 
nition, activation, condensation and modification of the single 
elongation units. The growing chain of amino acids and/or car- 
bonic acids is extended through the action of one module adding 
one unit. The domains of each module carry the active centers 
responsible for the enzymatic steps of the biosynthesis. 
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Litrle is known about the biosynthesis of biologically ac- 
tive polyketides and polypeptides from myxobacteria . Fragments 
of the biosv-nthetic gene clusters of soraphen and saframycin 
have been described from Sorangium cellulosuw So ce26 and Myxo- 
coccus xanthus, respectively (Schupp et al . , J, Bacteriol . 177 . 
3673-3679 [1995] and Pospiech et al . , Microbiology 141 . 1793- 
1803 [1995]). We have constructed genomic libraries of the 
epothilone producer Sorangium cellulosum So ce90. Gene probes 
based on PKS and PS genes were used to isolate recombinant cos- 
mids, which were then sequenced and characterized. Several 
unique pathways containing PKS, PS, or a combination of both 
types of genes were identified, demonstrating that this organ- 
ism is potentially a rich source of novel bioactive compounds. 

A subject of the present invention is therefore to provide 
DNA sequences according to claim 1 the expression products of 
which perform or are involved in the enzymatic biosynthesis, 
mutasynthesis or partial synthesis of polyketide or hetero- 
polyketide compounds. The DNA sequences may be inserted into 
well known and optimized expression vectors by commmon tech- 
niques of molecular biology, thus allowing transformation, se- 
lection and cloning of cells, which cells are then capable of 
synthezising polyketide or heteropolyketide compounds by fer- 
mentation. Using an overproducing clone allov/s the desired 
polyketide or heteropolyketide compounds be easily produced and 
recovered in high amounts. Further, knowledge of the localiza- 
tion of regulatory DNA segments and individual structural genes 
allows "site -directed mutagenesis" using common techniques for 
genetic engineering, and thus construction of optimized enzymes 
("protein engineering") for fermentative synthesis of polyketi- 
de or heteropolyketide compounds. 
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The invencicn thus further relates to a recombinant ex- 
pression vector according to claim 16, cells transformed there- 
with according to claim 17 and to a process for enzymatic bio- 
synthesis, mutasynthesis or partial synthesis of polyketide or 
heteropolyketide compounds according to claim 23. 

Preferred and/or advantageous embodiments of the present 
invention are subject-matter of the subclaims. 

In brief, the invention consists of (l) cloned Sorangium 
cellulosum polyketide synthase (PKS) and/or peptide synthetase 
(PS) biosynthetic cluster DNA and (2) the nucleotide sequence 
and predicted protein coding sequences of the cloned DNA. The 
invention can be used for, but not limited to, (a) increasing 
yields of PKS product in Sorangium cellulosum (e.g., by ampli- 
fication or genetic modification of the epothilone gene cluster 
or its component parts} , (b) increasing yields of polyketide 
and/or peptide synthetase product in a heterologous system by 
transfer of the corresponding gene cluster or its component 
parts, which may be followed by amplification or genetic modi- 
fication of the PKS and/or PS gene cluster or its component 
parts, (c) modification of the polyketide and/or peptide syn- 
thetase product chemical structure in either Sorangium cellulo- 
sum or a heterologous host (e.g., by genetic modification of 
the corresponding gene cluster or its component parts) and (d) 
for the detection of genes and gene products involved in making 
polyketides or related molecules in other organisms (e.g., by 
hybridization or complementation . assays) . DNA sequence and 
analysis is presented for the following cosmids and plasmids: 
A2 cosmid as defined in claim 6 

the pEP0cos6 region (overlapping of pEP0cos6 and pEPOcosT) 
as defined in claim 7 
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pEPOcosS cosmid as defined in claim 10 

AS cosmid as defined in claim 12 

Sau4 (10 kb plasmid; as defined in claim 14 

The invention is now described in more detail by examples and 
for illustration only. The examples are not to be construed as 
any limitation of the scope. 

Figure 1 is a restriction map of one of the DNA sequences of 
the present invention (cosmid A2 insert) indicating also the 
localization of regulatory DNA segments and the individual 
structural genes {"open reading frames" or ORFs) 1 to 16. 

Figure 2 shows the open reading frames found on pEPOcosS region 

DNA sequence data from A2 cosmid are as defined in claim 6, 

Table 1 correlates ORFs i to 16 found on A2 cosmid with the re- 
spective biological function (Regulators, Enzymes) . 
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Working Examples 

A. Construction of a Sorangium cellulosum cosmid library 

1. Isolation of genomic DNA from S. celluJosum So ce90 

a. Sorangium celluloaum So ce90 was spread onto solid CA-2 
agar and incubated at 3 0=C for 5-7 days. CA-2 agar is prepared 
by autoclaving is g Bacco-agar (Difco Laboratories, Detroit, 
MI) in 800 ml dKaO for 20 min at 121°C and cooling to 50-55°C in 
a water bath. The following filter-sterilized solutions are 
added to the agar: 20% (w/v) glucose, 50 ml; Solution A (7.5% 
[w/v] KN03,7.5% K2HPOO , 10 ml; Solution B (1.5% [w/v] 
MgSOr7H20), 10 ml; Solution C (0.2% [w/v) CaClj' 2H20, 0 . 15% [w/v] 
FeCls) , 10 ml; 1 M HCl , 1 ml; autoclaved 4-day old Sorangium 
cellulosum broth, 100 rr.l . A sample of cells was removed from 
the plates with a sterile loop and inoculated into 50 ml of 
G51c medium in a 250 ml Erlenirieyer flask, G51t consists of 0.5% 
starch (Cerestar) , 0.2% tryptone, 0.1% yeast extract, 0.05% 
CaCl., 0.05% MgSO4-7H30, 1.2% 4 - (2 -hydroxyethyl ) - 1 -piperazine- 
ethanesulfonic acid (HEPSS) , 0.2% glucose, pH 7.6. The flasks 
were shaken at 3C°C, 160 rpm until a dense orange bacterial 
growth was obtained (ca. 5-7 d.). The cells were pelleted by 
centrifugation at 6,000 x g and used immediately or stored fro- 
zen at -20°C. 

The protocol used for isolating chromosomal DNA from bac- 
teria using hexadecyltrimethylammmonium bromide (CTAB) has been 
described previously (Ausubel et al . , Current Protocols in Mo- 
lecular Biology, John V?iiey and Sons, New York, 1990) . The pre- 
cipi-ated DNA was recovered with a bent Pasteur pipette, washed 
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with 70% and 95% ethanol, air-dried, and resuspended in 0.5. ml 
TE buffer {0.01 M Tris-HCl. 0.001 M ethylenediaminotetraacetic 
acid [EDTA] , pH 8.0). 

b. Alternatively, genomic DNA was isolated from S. cellulosum 
cells cultured as described in section A.l using the Midi 
Qiagen Blood & Cell Culture DNA purification Kit (Qiagen, 
Hilden, Germany) following the Qiagen Genomic DNA Handbook pro- 
tocol for bacterial DNA isolation (X9S7, Qiagen, Hilden, Ger- 
many, p. 29 ff.). In order to obtain high molecular weight 
chromosomal DNA the precipitated DNA was recovered with a bent 
pasteur pipette as described in section A.l. 

2. Isolation of plasmid DNA 

a. pFD666! pFD666 is a bifunctional E. coli-Streptomyces cosmid 
cloning vector (see Denis and Brzezinski, Gene 111 . 115-118 
[1992]). To maintain stability of large inserts, it is present 
in low-medium copy number when replicated in E. coll. For this 
reason, isolation of sufficient pure DNA to carry out cloning 
experiments was difficult using commercial kits with standard 
protocols. A modified procedure was therefore used to obtain 
pFD666 DNA. A 10 ml culture of DHIOB (pFD666) was grown for 16- 
20 hr at 37*^C in LB (1% tryptone, 0.5% yeast extract, 0.5% 
NaCl, pH 7.0) medium containing 50 ^9/^1 kanamycin sulfate. 
Fifty ml of LB + kanamycin was inoculated to a starting ODgoo of 
ca. 0.25 and shaken at 300 rpm, 37°C, until the ODeoc reached 
ca, 0.6. Five hundred ml of LB + kanamycin medium in a 2 1 
flask was inoculated with 25 ml of this culture and incubated 
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under the sane conditions for 2.5 hr. Chloramphenicol i 2 . 5 ml 
of a 34 mg/mi solution in 100% EtOH) was added and the incuba- 
tion continued for an additional 16-20 hr . (The previous steps 
were performed according to Maniatis et al . Molecular Cloning: 
A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY, 1989.) Cells were pelleted for 10 min, 16,000 x g . 
They were resuspended in 9 ml of 50 mM glucose/25 mM Tris-HCl 
(pH 8.0) /lO mM EDTA and transferred to a 50 ml disposable cen- 
trifuge tube. One ml of a freshly-prepared 10 mg/ml lysozyme 
solution in 10 mM Tris-HCl, pH 8 , 0 was added and the cell sus- 
pension incubated in a 37*C water bath for 10 min. Twenty ml of 
a freshly-prepared 0.2 NaOH/1% sodium dodecyl sulfate (SDS) so- 
lution was added and the tube inverted gently 5-7 times to mix 
the contents. After 5 min at room temperature, 15 ml of 5 M po- 
tassium actate (pH 4.8) was added and the tube inverted sharply 
3-4 times. The tube was centrifuged at 6,000 x g for 10 min at 
4°C and the supernatant poured though 2 layers of sterile 
cheese cloth into a fresh 50 ml disposable tube. Isopropanol to 
a final concentration of 0.6% was added and the contents of the 
tube mixed several times. The precipitated nucleic acid was 
centrifuged at 6,000 x g for 10 min at 4*'C. The pellet was 
washed with 70% EtOH and any excess EtOH was aspirated from the 
pellet, which was allowed to air dry for 5 min. It was resus- 
pended in 5 ml of 50 mM 3- (N-Morpholino) propanesulf onic acid 
(MOPS)/750 mM NaCl, pH 7.0 and added to an equilibrated to 
QIAfilter Midi column (Qiagen, Chatsworth, CA) . The manufac- 
turer's protocol for washing and eluting the plasmid DNA was 
followed. 
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b. SuperCos: SuperCos clasmid DMA was purchased from Strata- 
gene (La Jolla, CA) . 

3. Preparation of ca» 3 8-47 kb Sau3Al fragments of S, cellulo- 
sum chroinosoinal DNA 

a. S. cellulosum chromosomal DNA prepared as described in sec- 
tion A.l.a was partially cleaved with restriction endonuclease 
Sau3Al in a 1000 ^il reaction volume consisting of 50 pig chromo- 
somal DNA, 5 units enzyme (Promega, Madison , WI), 0.006 M 
Tris-HCl, 0.006 M MgClj, 3.10 M NaCl , and 0.001 M dithiothrei- 
tol (pH 7,5) for 5 min at 37°C. The reaction mixture was ex- 
tracted once with an equal volume of 1:1 phenol : chloroform. 
After cent rifugat ion, che upper aqueous phase was saved, to 
which 0.1 vol. of 3 M scdium acetate and 0.6 vol. isopropanol 
was added. DNA was pelleted by centrifugation for 5 min at 
16,000 X g in a microfuge and washed once with 0.5 ml 70% EtOH. 
After drying in a SpeedVac (Savant Instruments, Farmingdale. 
NY) for 5 min, the peliez was resuspended in 0.1 ml TE buffer. 
The DNA was layered onccp of a 12 ml 10-40% sucrose gradient 
prepared in TH buffer and ceiitrifuged at 113,600 x g for 16 hr, 
10*^C using a Beckman SV/40Ti rotor (Beclcman Instruments, Palo 
Alto, CA) . Five hundred ^tl aliquots of the gradient were re- 
moved using a pipetor beginning at the top of the tube. Samples 
(5 /il) of the fractions were analyzed by electrophoresis 
through a 0.5% agarose gel in TAE buffer (0.04 M Trizma base, 
0.02 M acetic acid, and 0.001 M EDTA, pH 8.3) containing 0.5 
;:g/ml ethidium bromide for 6 hr at 100 V. Fractions containing 
DNA fragments of ca. 40-45 kh were identified by comparison to 
3 i^igh„. molecular weichz DNA standard (Life Technologies, 
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Gaithersburg, MD) . Sucrose was diluted from the corresponding 
0.5 ml fraction by addition of 0.5 vol. TE. Subsequently, DNA 
was precipitated by addition of 0.1 vol. 3 M sodium acetate and 
0.6 vol. isopropanol. DNA was pelleted by centri f ugation at 
16,000 X g for 10 min in a microfuge. DNA was washed with 0.5 
ml 70% EtOH and dried in a SpeedVac with moderate heat for 10 
min. Finally, the DNA was resuspended in distilled H2O at a 
concentration of 0 . 5 mg/ml. 

b. Alternatively, 10 fig of S. cellulosum chromosomal DNA pre- 
pared as described in A.l.b was treated with 0.3 U SauBAl {New 
England Biolabs, Beverly, MA) for 1 h at 37*^0 in 400 ^1 of the 
supplier's recommended reaction buffer. Formation of DNA frag- 
ments of about 40 kb in size was checked by comparison of the 
motility behavior with high molecular weight DNA standards af- 
ter a 0.3% agarose gel electrophoresis. An equal volume of phe- 
nol : chloroform (1:1) was added, mixed and centrifuged. The up- 
per aqueous phase was recovered and 0.1 vol. of 3 M sodium ace- 
tate and 0.6 vol. of isopropanol were added. After centri f uga- 
tion, the precipitated DNA was washed twice with 0 . 5 ml 70% ice 
cold ethanol and finally air-dried. The DNA fragments were re- 
suspended in 100 shrimp alkaline phosphatase reaction buffer 
and dephosphorylated for 150 min. at 37°C using 2 U shrimp al- 
kaline phosphatase <Amersham Life Science, Cleveland, OH) . A 
phenol : chloroform extraction followed as described above. Fi- 
nally, the DNA was precipitated by addition of O.l vol. 3 M so- 
diur acetate and 0.6 vol. isopropanol, dried, and dissolved in 
TE buffer. 
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4. Preparation of cosmid libraries 

a. Using pFD666: Vector pFD666 was cleaved with restriction 
endonuclease BamHl in a 0.02 ml reaction volume consisting of 2 
fig plasmid DNA, 10 units of BamHI CPromega) , 0.006 M Tris-HCl, 
0-006 M MgCls, 0.05 M NaCl , and 0.001 M di thiothreitol (pH 7.5) 
for 90 min at 37°c. Five. ^1 of lOx alkaline phosphatase buffer 
(0.5 M Tris-HCl [pH 9.3], 0.01 M MgCla, 0.001 M ZnClj. 0.01 M 
spermidine) was added co the reaction followed by alkaline 
phosphatase (0.01 units/pmol ends; Promega) and distilled H2O 
to a final volume of 0.05 ml. The sample was incubated for 30 
min at 3 7°C and a second aliquot of phosphatase was added. Af- 
ter a further 30 min at 37°C, .0.3 ml of stop buffer (0.01 M 
Tris-HCl CpH 7.5], 0.001 M EDTA, 0.2 M NaCl, 0.5% SDS) and 0.35 
ml of 1:1 phenol; CHCI3 was added to the reaction. The sample 
was mixed gently several times by inversion and centrifuged at 
16,000 X g for 3 min to separate the phases. The aqueous layer 
was removed co a new microfuge tube. 0.1 vol. 3 M sodium ace- 
tate and 2 vol. 100% EtOH were added and the precipitated DNA 
pelleted by centrif ugation at 16,000 x g for 10 min. Liquid was 
removed by aspiration and the pellet washed once with 0.5 ml 
70% EtOH. The DNA was dried in a SpeedVac and resuspended in TE 
buffer to 0.5 mg/ral . 

Digested, phosphatase- treated pFD666 was ligaced to the 
partially-cleaved chromosomal DNA (see sections A. 3. a and 
B.l.a) in a 0.005 ml reaction consisting of 1 ^g pFD666, 1 fig 
S. ceiiuiosu/n DNA, 0.03 M Tris-HCl (pH 7.8), 0.01 M MgCla, 0.01 
M di thiothreitol, and 0.0005 M adenosine- 5 ' -triphosphate and 
1.5 Weiss units of T4 DNA ligase (Promega). The reaction was 
carried out at room temperature for 2 hr. The entire reaction 
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mix was packaged inco bacteriophage X in vitro using Fackagene 
extracts (Promega) according to the manufacturer's directions. 
The entire packaging reaction (0.5 ml) was diluted with 4 . 5 ml 
SM buffer (per liter: 5.8 g NaCl , 2 g MgSO4.7H20, 1 M Tris- 
HCl[pH 7.5], 5 ml 2% gelatin solution). Transfection was per- 
formed by adding 10 ml cf an overnight culture of E, coli DH5a 
that had been grown in LB medium with 0.01 M MgS04 and 0.2% 
maltose to the diluted phage and incubating at 37*'C for 20 min. 
0.8 ml of LB was added and the cells shaken at 22 5 rpm for 1 hr 
at 2l''C. Cells were pelleted, resuspended in LB, and spread 
onto a 150 mm LB + kanamycin agar plate. After 3d. at 30°C, 
the colonies were harvested by picking ca. 800 colonies into 
2.0 ml LB + kanamycin medium containing 20% glycerol , freezing 
on dry ice, and storing at -70°C. In addition, six kanamycin- 
resistant colonies were inoculated into 2 ml LB + kanamycin 
liquid medium and incubated at 37**C, 250 rpm, for 18-24 hr. 
Cosmid DNA was prepared using a standard alkaline lysis proce- 
dure starting with 1.5 rr.l of the culture, DMA was digested with 
restriction endonuclease ?stl and samples electrophoresed on a 
0.8% TAE agarose gel for 1.5 hr at 100 V. A unique restriction 
pattern was noted in each sample and the total size of the in- 
sert was calculated to be between 40 and 45 kilobases. 

b. Using SuperCos: 30 fig of vector SuperCcs was digested with 
Xbal (New England Biolabs, Beverly, MA) for 210 min at 37°C in 
100 /il of the recommenced reaction buffer. Ten /il sodium ace- 
tate and 60 /il isopropar.ol was added before the solution was 
centrifuged for 30 min at 16,000 x g. The precipitated DNA was 
washed twice with 500 fxl ice cold 70% ethanol . The vector DNA 
was precipitated and air-dried, dissolved in 135 pi . shrimp al- 
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kaline phosphatase reaction buffer and created wizh 2.5 U 
shrimp alkaline phosphatase for 150 min. After heat inactiva- 
tion of the enzyme at 75^C for 20 min, a phenol : chloroform ex- 
traction was performed as described in section 1. c. The DNA. 
resuspended in 100 /il BamHI restriction buffer was hydrolyzed 
with 15 U BamHI (New England Biclabs, Beverly, MA) for 180 min. 
A phenol : chloroform extraction followed (see section A. 3). The 
SuperCos DNA was precipitated by additon of Q.l vol 3 M sodium 
acetate and 0.6 vol isopropanol, centrifuged at 16,000 x g, and 
resuspended in 50 /zl TE buffer. 

Four ^g of digested vector DNA was ligated with 10 par- 
tially hydrolyzed genomic DNA from S. ceJIuiosum (as described 
in section A.3.b) in a final volume of 20 /xl using 2 U T4 DNA 
ligase and the appropriate reaction buffer (Gibco BRL, Eggen- 
stein, Germany) . The reaction was carried out at 16°C over- 
night. The reaction mixture was packaged into phage particles 
using the Gigapack III XL packaging extract kit (Stratagene) 
according to the manufacture's protocol. Treatment of packaging 
reaction mixture and transfection of E. coll SURE (Stratagene) 
was performed as described in 4. a. Transfected cells were con- 
centrated by centrifugation, resuspended in fresh LB medium and 
distributed on LB agar plates containing 50 /ig/ml"^kanamycin. 
The plates were incubated overnight at 20^C, 1600 recombinant 
clones were transferred into 96 well microtiter plates filled 
with 80 ^1 LB medium conzaining 50 Mg/iTil'kanamycin per well and 
propagated overnight at '30«C. The following day zhe microtiter 
plates were used to inoculate a second set of microtiter plates 
in order to obtain a duplicate of the recombinant clones. Each 
well of the original sez of microtiter plates was supplemented 
with 80 Ml 50 % glycerol and the entire plate stored at -70*»C. 
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20 randomly chosen transf ormants were inoculated into 3 ml 
LB medium with 50 ;ig/ml"^ kanamycin and incubated over night at 
37«C in order to isolate plasmid DNA using the Qiagen plasmid 
extraction kit (Qiagen, Hilden, Germany) . Restriction fragment 
analysis of the recombinant cosmids using the restriction endo- 
nucleases PstI and Bglll indicated that the cosmids contained 
inser-s of approximately 35 to 42 kb in size. 

B. Construction of a S. cellulosum plasmid library 

1. Preparation of 8-12 kb fragments of 5. cellulosum chromoso- 
mal DNA. 

S. celiuJosum chromosomal DNA prepared as described in sec- 
tion A.l.a was partially cleaved with restriction endonuclease 
Sau3Al in a 100 fiL reaction volume consisting of 5 ^g chromoso- 
mal DMA, 5 units enzyme (Promega, Madison , WI) , 0.006 M Tris- 
HCl, 0.006 M MgClz, 0.10 M NaCl , and 0.001 M dithiothreitol (pH 
7.5) for 4 min ac 37°C. The digested DNA was elect rophoresed 
through a 11 x 14 cm 0.8% TAE-agarose gel for 18 hr at 17 V. 
Fragments of 8-12 kb were cut. from the gel and purified using 
the QIAquick Gel Extraction Kit using the manufacturer's proto- 
col (Qiagen) . 

2. Preparation of the plasmid library 

Plasmid pZero2 . 1 (Invitrogen, Carlsbad, CA) was cleaved with 
resirictidn endonuclease BamHI in a 0.02 ml reaction volume 
consisting of 1 ^g plasmid DNA, 10 units of BamHI (Promega) , 
0.036 M-Tris-HCl, 0.006 M MgCla, 0.05 M NaCl, and 0.001 M di- 
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thiothreitol { pH 7.5) for 20 min at 37°C. 0.08 ml of dHjO and 
0.1 ml of 1:1 phenol :CHC13 was added. The sample was briefly 
vortexed and centrifuged at 16,000 x g for 2 min. The aqueous 
layer was removed to a new microfuge tube. 0.1 vol. 3 M sodium 
acecace and 2 vol. 100% EtOH were added and the precipitated 
DNA pelleted by centrifugation at 16,000 x g for 10 min. Liquid 
was removed by aspiration and. the pellet washed once with 0.5 
ml 70% EtOH. The DNA was dried in a SpeedVac and resuspended in 
TE buffer to 0.004 {xg/nl . Digested pZero2 . 1 was ligated to the 
partially-cleaved chromosomal DNA in a 0.01 ml reaction con- 
sisting of 0.004 lag pZero2 . 1 , 0.05 ^g S. ceiluiosu/n DNA, 0.03 M 
Tris-HCl (pH 7.8), 0.01 M MgClj, 0.01 M dithiothreitol , and 
0.0005 M adenosine-5' -triphosphate and 1.5 Weiss units of T4 
DNA ligase (Promega) . The reaction was carried out at room tem- 
perature for 2 hr. 0.015 ml dHjO and 0.25 ml of 1-butanol were 
added, the sample vortexed briefly, and centrifuged at 16,000 x 
g for 10 min. Liquid was aspirated away from the pellet and the 
sample dried in a SpeedVac for 5 min. The ligated DNA was re- 
suspended in 0.005 ml dHjO and mixed with 0.04 ml of electro- 
competent Escherichia coll DHIOB cells (GIBCO/BRL, Gaithers- 
burg, MD) . The sample was placed into a pre-chilled 0.2 mm-gap 
electroporation cuvette and transformed into the bacteria by 
electroporation using a BioRad Gene Pulser II unit (BioRad, 
Hercules, CA) at 25 ^iF and 200 0.96 ml SOC medium (0.5% 

yeast extract, 2% tryptone, 10 mM NaCl, 2.5 mM KCl , 10 mM 
MgCl2, 20 mM MgSO^ , 20 mM glucose) was mixed with the cells and 
transferred to a 1.5 ml microfuge tube. The sample was incu- 
bated at 37^C, 225 rpm, for 1 hr . Aliquors of the cells were 
spread onto an LB agar -r kanamycin and incubated at 3 7®C for 20 
hr to estimate the number of transf ormancs obtained. Six kana- 
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mycin resistant colonies v;ere confirmed to contain an insert of 
the expected size as described in section A. 4. a. 

C. Identification of cosmids possessing polyketide synthase 
genes 

1. Colony blot hybridizations using cosmid library in pFD666 : 

A 20 X 20 cn sheer of Duralon UV membrane (Stratagene) 
was placed on top of a 24 . 5 x 24 . 5 cm square bioassay dish con- 
taining 250 ml LB agar - kanamycin. An aliquot of the frozen 
cosmid library in 1 ml IH medium was spread on the filter. The 
plate was incubated at 37-C for 24 hr. Colonies were replicated 
onto two fresh filters which were placed onto LB + kanamycin 
agar medium and incubated at 28°C for 18 hr. Lysis of cells and 
neutralization of released DNA was performed according to di- 
rections that were provided with the filters. The DNA was 
crosslinked to the filters using a UV Stratalinker 2400 unit 
(Stratagene) in the autc crosslink mode. Cell debris was re- 
moved by placing the fil-ers in a container with a solution of 
3 X SSC (20 X SSC contains, per liter, 173.5 g NaCl , 88.2 g so- 
dium citrate, pH adjusted to 7,0 with 10 N NaOK) , 0.1% SDS and 
rubbing the lysed colonies with a Kimwipe. The filters were 
then incubated at least 3 hr with the same wash solution for at 
least 3 hr at 65*^C. The plasmid library was treated similarly 
except cells were spread onto a 137 mm circular Duralon UV mem- 
brane placed on top of a 150 mm petri dish containing 80 ml LB 
agar kanamycin. 

For hybridizations, a probe consisting of a 650-base "pair 
(bp) polymerase chain f?C?.) fragment representing a portion of 
a S. cellulosum polyketide synthase gene was used. The fragment 
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was amplified using primers to consensus regions of Type. I 
(macrolide) polyJcetide synthase (PKS) genes (Swan et al . , Mol . 
Gen. Genetics 242, 358-362 [1994]). A series of sense and anti- 
sense oligonucleotides were prepared for PGR studies as indi- 
5 Gated in the following table 2: 

15 

Table 2 





Oligo- 


I. DMA seauen-e (5'-> 3') 


Corresponding 
amino acid 
sequence 


20 


nucleotide 






120 


CGGT ( C/G ) AAGTC ( C/G ) AACATCGG 


KSNIGHT 


25 


(sense) 








121 (anti- 


GC (A/G) ATCTC ( A/G) CCCTGCGA (A/G) TG 


HSQGEIA 




sense) 








122 


GT (C/G) GACAC (C/G) GC (C/G) TGCTC (C/G) 


VDTACSS 


30 


(sense) 








123 


GG (C/G) AC (C/3) AACGC (C/G) CACGT (C/G) A 


GTNAHVI 




(sense) 


T 




35 


124 (anti- 


CCCTG ( C/G) CC :c/G) GGGAA (C/G) ACGAA 


FVFPGQG 


sense) 







10 

40 The selection of C or G where necessary in the third position 

of a codon reflects the very high overall G + C content of S. 
- cellulosum (ca. 70%). Conditions for PGR were as follows: 0.01 
M Tris-HCl (pH 9.0), 0.C5 M KCl, 0.003 M MgClj, 0.1% Triton X- 

45 

15 IOC,. 200 of each primer, 2.5 U Taq DNA polymerase (Promega) , 
5.0% dimethyl sulfoxide : Sigma) . and 0.01 /ig of S. celiuiosu/n 
chromosgmal DNA in a O.ZS ml reaction volume. Reactions were 
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carried out in a Perkin-Elmer Model 480 Thermocycler (Perkin- 
Elmer Corporation, Foster City, CA) under the following condi- 
tions: 94°C, 1 min; SO^^C, 1 min, 72°C. 1.5 min for a total of 30 
cycles. Each possible combination of sense and ant i -sense prim- 
ers were tried. A 650-bp and 350-bp fragment was amplified us- 
ing cligos 120 + 124 and 123 +124, respectively. The sequence 
of Che fragments were determined using the ALFexpress AutoRead 
kit to fluorescently label the DNA, which was analyzed on an 
ALFexpress sequencing apparatus (Pharmacia) . The data indicated 
both PCR fragments possessed significant homology to polyketide 
synthase genes of Type I antibiotics. The 650-bp fragment was 
chosen for hybridization experiments. 

The fragment was labeled with "p-dCTP using the NEBlot kit 
(New England Biolabs, Beverly, MA) and purified on a Bio-Spin 6 
column (BioRad, Hercules, OA.). Duplicate blots were pre- 
hybridized in 3 X SSC (1 X SSC contains 0.15 M sodium chloride 
and 0.015 M sodium citrate, pH 7.0), 4 X Denhardt's solution 
(100 X is 2% Ficoll [Type' 4 00] , 2% polyvinylpyrrolidone, and 2% 
bovine serum albumin (Fraction V]), and 100 tig/ml sheared, de- 
natured salmon sperm DNA; all reagents purchased from Sigma 
Chemicals, St. Louis. The labeled DNA was heated in a boiling 
water bath for 5 min to denature the strands, cooled on ice, 
and added to the pre-hybridization solution. The filters were 
incubated for at least 18 hr in a roller bottle hybridization 
oven. They were transferred to new bottle, then washed two 
times in 2 X SSC, 0.1% SDS at 70^C for 30 min (moderate strin- 
gency} . The membranes were placed on Whatman 3MM paper to re- 
move excess liquid, covered with Saran Wrap, and exposed to 
auTioradiography film (Kodak X-OMAT L3) with two intensifying 
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screens. The cassette v/as placed at -70°C and developed at ap- 
propriate intervals. 

Approximately 100 colonies were seen to have hybridized on 
Che duplicate filters. Fourteen of these were isolated from the 
master plate and grown in 4 ml LB + kanamycin medium for 20-24 
hr, 3 7°C, 2 50 rpm. Plasmid DNA was prepared using the standard 
alkaline lysis method and digested with restriction endonucle- 
ase Pstl. The digested DNA was electrophorissed on a 0.8% aga- 
rose gei in TAE for 3 hr at 100 V. Fragments were transferred 
to Duralon UV using the VacuGene XL vacuum blotting unit (Phar- 
macia) and the recommenced alJcaiine denaturation protocol. Hy- 
bridization with radioactively- labeled PGR fragment and washing 
were carried out as described above. Two prominent types of 
cosmids were observed; one contained Pstl fragments of ca. 7.0, 
5.0, and 1.1 kb (pEPOcoso and pEP0cos7) that hybridized to the 
probe; the other type had fragments of ca. 6.0 and 3,6 kb 
(pEPOcosa and pEP0cosl3; which were homologous to the probe. 
Restriction analysis confirmed that cosmids showing identical 
hybridization patterns had identical or overlapping inserts. 
PGR reactions using primers representing consensus sequences of 
Type I PKS genes were performed using the isolated cosmid DNA 
as template under conditions described above, except ca. 0.01 
^g of cosmid DNA was included as template. Cosmids pEP0cos6 and 
pEPOcosB amplified the 650 -bp fragment seen when oligonucleo- 
tides 120 + 124 were used, while pEPOcosS and pEP0cosl3 sup- 
ported amplification cf an 1100-bp PGR fragment with oligos 122 
and 124. The latter fragment was sequenced and confirmed to 
possess strong similarity to Type I PKS genes. These data con- 
firm that the recombinant cosmids are related to each other and 
that all contain PKS -like genes. 
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2. Colony blot hybridizations of plasmid library in pZero2.1: 

A 137- mm circle of Duralon UV membrane was placed on top 
of a 150-mm containing 75 ml LB agar + kanamycin. An aliquot of 
the plasmid library (representing ca. 2,000 recombinant colo- 
nies) in 0.5 ml LB mediuni was spread on the filter. The plate 
was incubated at 37°c fcr 20 hr. Colonies were replicated onto 
two fresh filters which were placed onto LB + kanamycin agar 
medium and incubated at 37°C for 6 hr. The filters were proc- 
essed for hybridization as described in Section C.l. Out of 8 
positive colonies detected, one contained a plasmid with a DNA 
region noz encoded by ei-her pEP0cos6 or pEPOcosS . This plas- 
mid, called Sau4, was characterized in more detail. 

3. Colony blot hybridizations of cosmid library in SuperCos: 

The recombinant E. coli clones from the microtiter plates 
(see section 4. b) were used to produce two identical sets of 
hybridization filters in order to identify cosmids carrying PKS 
and PS genes. The recorrJrinant clones were spotted onto 2 sets 
of 22 X 22 cm LB agar plates ' containing 50 /zg/ml kanamycin. 
Each plate contained 384 clones therefore representing 4 micro- 
titer plates. The clones were incubated at 30*0 overnight. Af- 
ter pre-cooling for approximately 3 h at 4°C, 20 x 20 cm Hybond 
N" Nylon membranes (Arr.ersham, Braunschweig, Germany) were 
placed onto the agar surfaces. After 2 min. the membranes were 
removed and placed for 15 min. on Whatman 3 MM paper (Whatman 
paper Ltd., Maidstone, England) soaked with denaturation solu- 
tion (0.5 N NaOH, 1,5 K XaCl ) before they were transfered onto 
Whacman 3 MM paper saturated with neutralization solution {1 M 
Tris-HCl, pH 7.5, 1.5 K .MaCl) . Subsequently the membranes were 
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placed onto Whatman 3 MM paper soaked with 2 X SSC (0.3 M NaCl, 
0.03 M sodium citrate, pH 7.2) for 10 min. The membranes were 
baked for 4 0 min at 85=C. Then, each membrane was overlayed 
with 5 ml Proteinase K solution {2 mg/ml Proteinase K in 2 x 
SSC) and incubated at 37 **C for 90 min. Finally, cell debris 
was removed by wiping the membranes with a Kimwipe pre-wetted 
with 2 X SSC. 

AS we were seeking in particular to identify biosynthetic 
pathways containing both PKS and PS genes, the following hy- 
bridization strategy was taken: The screening was initially fo- 
cused on ketosynthase domains from type I PKSs and on the ade- 
nylation domain from PSs. Target-specific primers were used co 
amplify DNA fragments of the corresponding genes from chromoso- 
mal DNA of S, ceiiulosum by PCR. The fragments obtained were 
then cloned, sequenced and the deduced amino acid sequence com- 
pared to known ketosynthase and adenylation domains of PKS and 
PS, respectively. In a second step these PCR fragments were 
used as gene probes to detect recombinant cosmids of the S. 
cellulosum cosmid library. 

Oligonucleotides based on conserved amino acid sequences 
of ketosynthase domains from various- type I PKS were optimized 
for myxobacterial DNA by comparison to a known myxobacterial 
biosynthetic gene cluster (Schupp et al . , J. Bacterid. 177, 
3673-3679 [1995]) resulting in primer 
KSlUp (5'- 

C/A;GIGA(A/G)GCI (A/C/T) (A/T)I(C/G) (C/A)IATGGA(C/T)CCICA(A/G)CAI 
(A/C)G-3') and 

KSCl (S'-GG(A/G)TCICCIA(A/G) I (G/C) (T/A) IGTICCIGTICC (A/G) TG- 3 ' ) . 
PCR-primers TGD (5'- 

TCA/T) (C/T)CGIACIGGIGA(C/T) (C/T) (G/T) IG (G/T) ICG- 3 ' ) and 
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LGG (5'- 

A(A/T)IGA(A/G) (G/T) (G/C) ICCTCCI (A/G) (A/G) (G/C) I (A/C) (A/G)AA(A/G 
)AA-3') 

directed to genes encoding adenylation modules have been de- 
scribed by Turgay et al . (Pepc. Res. 7, 238-241 [1994]). ?CR 
reaction mixtures with a final volume of 25 ^1 contained 0 . 1 /^g 
template DNA, 0.2 U Taq DNA- polymerase (Gibco BRL, Eggenstein, 
Germany), 5 ^mol dNTP, 5% dimethyl sulfoxide (Sigma), 1.5 mM 
MgClj, 25 pmol ot each primer and the appropriate reaction 
buffer supplied by Gibco BRL. Chromosomal DNA of S. ceiiuiosum 
was used as template. Additionally, chromosomal DNA of Myxococ- 
cus fulvus was used with PS primers. Reactions were carried out 
in an Eppendorf Mastercyler Gradient (Eppendorf, Germany) using 
the following conditions: denaturacion 30 s at 97°C, annealing 
30 s at 55°C, extension 60 s at 72*^C for a total of 30 cycles. 
The formation of ca . 700 bp fragments using the KS primers and 
of ca. 350 bp fragments with the PS primers were confirmed by 
0.8% agarose gel electrophoresis. Fragments of independent PGR 
reactions were ligated into vector pCR2.1T0P0 using the TOPO TA 
Cloning Jcit (Invitrogen, Leek, The Netherlands) according to 
the manufacturer's protocol and transformed into E. coli XLl- 
Blue. Sequencing of the resulting plasmids and analysis of uhe 
deduced amino acid sequence revealed three different KS frag- 
ments, designated pM008.4, pM008.6, pM008.7, one PS fragment 
(pAPsl) corresponding to S. cellulqsum and one PS fragment 
(pDPsl) obtained with chromosomal DNA of M. fuivus. The PGR 
fragments were re- isolated by digestion wizh EcoRl from the 
plasmids pM008 . 4 , pM008.6, and pM008.7, labeled, pooled and 
used as gene probes in hybridiaation experiments as described 
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below. The same procedure was performed with the PS fragments 
of pAPsl and pDFsl . 

Hybridization with PKS and PS specific DNA probes (see 
above) was carried out using the DIG nonradioactive labeling 
and detection kit {Boehringer Mannheim, Germany) and performed 
according to the supplier's manual using buffer containing 50% 
formamide. The membranes were hybridized in plastic bags con- 
taining approx. IC ml of hybridization solution at 3 9°C over- 
night. Unspecific binding of probes was removed by 2 wash steps 
with 2 X SSC, 0.1% SDS at room temperature for 20 min. and one 
stringent wash step with 0.5 x SSC, 0.1% SDS at 60°C for 20 
min. Detection of hybridizing DNA fragments was performed with 
the above mentioned system according to the manufacturer's pro- 
tocol using CSPD as chemi luminescent substrate. The signals 
were recorded by exposure of the treated membrane to Hyperfilm 
ECL {Amersham Life Science, Little Chalfont, England) which was 
developed in appropriate time intervals. 

71 signals were detected with the PKS specific gene probe. 
On the duplicate filters 35 signals were obtained with the PS 
specific gene probe of which 7 were already known from the PKS 
hybridization experiment. These recombinant cosmids harbored 
PKS- and PS-encoding genes. In order to corroborate these re- 
sults PGR experiments were performed with DNA of the 7 recombi- 
nant cosmids as template and PKS (KSlUp, KSDl) and PS specific 
primers (TGD, LGG) generating fragments of the expected size df 
approx. 700 bp and 3 50 bp, respectively (primers and reaction 
conditions see above) . 

A comparison of the restriction fragment patterns of the 
DNA from the 7 recombinant cosmids carrying PKS and PS genes 
digested by BamHI facilitated an arrangement of the cosmids in 
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3 groups. They were represented by cosmids designated A2 and 
A5. The remaining croup was represented by pEPOcosS . Therefore, 
A2 and A5 represented gccd candidates for further DNA sequence 
analysis because they carry both PKS and PS genes. 

D. Random "shotgun" sequencing of recombinant cosmids and plaa- 
mids 

1, Library construction 

a. pEPOcosS, pEPOcosS, A5, and Sau4 : pEPOcos6 and pEPOccs? 
were sequenced to comple-ion, and contiguous sequence data and 
analysis for these overlapping cosmids is presented below for 
the "cos6 region" {cf. claims 7 and 9). Sequencing of cosmid 
A5, pEPOcosS and plasmid Sau4 was taken to the point of large 
contiguous sequences (ccntigs) representing the S. cellulosum 
insert; sequence and analysis presented below {cf. claims 10 to 
15) . 

Randomly sheared libraries were constructed for cosmids 
and plasmids of interest using a protocol similar to that of of 
Fleischmann et al . , 1955 (Science 269 , 496) and modified in 
Eraser et al . , 1995 {Science 370, 397). Briefly, Qiagen-column 
purified cosmid DNA (-1C ^g) was sheared to a size of approxi- 
mately 2 kb and the DNA end- repaired using BAL31 nuclease. The 
DNA was gel -purified after electrophoresis through a 0.75% low- 
melting temperature agarose gel containing 0.5 /io/ml ethidium 
bromide in IX TAE buffer run at 80 V for 2 hours. The volume of 
the low-meXt agarose gel slice was estimated by adding the gel 
slice to a microfuge tube and weighing, then 0.1 vol. of 3 M 
sodium acetate (pH 7) v.*a3 added and the agarose incubated at 
60°C. The temperature -.vas equilibrated to 37°C, and DNA ex- 
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tracted twice using an equal volume of buffered phenol (Life 
Technologies) . The aqueous phase was transferred and extracted 
once with an equal volume of chloroform, then ethanol preci- 
pated by the addition of 2 vol. cold 100% ethanol. DNA was con- 
centrated by spinning at 16,000 x g in a microcentrifuge. The 
DNA pellet was washed with 1 ml 70% ethanol and resuspended in 
100 Ml of O.IX TE. The DNA was ligated to Smal -digested, phos- 
phatase- treated pUCia vector (Pharmacia) , and single insert re- 
combinants isolated by gel-purification of the band containing 
vector plus a single insert, followed by T4 polymerase polish- 
ing, and a final intramolecular ligation of the vector-plus- 
single - insert DNA. This final ligation represents a library of 
highly random ca. 2 kb fragments that was used for shotgun se- 
quencing of the ca. 40 kb cosmids or ca. lO kb plasmids. 

b. Cosmid A2 : Cosmid DNA with inserts of S. cellulosum was 
isolated by an alkaline lysis procedure and purified with Ma- 
cherey Nagel columns (Machery und Nagel GmbH und CoKG, Duren, 
Germany) using manufacturer's recommendation. Purified Cosmid 
DNA was sonicated, end- repaired using T4 DNA Polymerase (Boe- 
hringer Mannheim, Germany) . After gel -purification fragments of 
a size of approximately 2 kb were ligated into Smal -digested, 
phcsphatase-treated pTZlSR vector (Pharmacia) . The ligation 
represents a library of highly random ca . 2 kb fragments that 
was used for shotgun sequencing of the ca . 4 0 kb cosmid. 

2. Sequencing and assembly 

a. pEP0cos6, pEPOcosS, Sau4, and A5 : DNA (1 ;zl of 100 

total in the library) was transformed into K. coli by electro- 
poration, (20 /xl of Electromax DHIOB cells from Life Technolo- 
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gies) and cells spread onto LB plates containing 50 Mg/ml ampi- 
cillin. After growth overnight at 37 °C, transf ormants (ca. 300- 
3000 CFU total) were tranfered to 96-well growth blocks and 
shaken overnight at 37°C in 1.3 ml LB medium with 50 /zg/ml am- 
picillin. Templates were prepared from these cells by an alka- 
line lysis procedure (Qiagen QiaQuick Turbo Prep) to yield pu- 
rified, double -stranded plasmid DNA. Cycle -sequencing of the 
plasmid templates was performed using universal forward and re- 
verse primers and BigDye Terminator sequencing kits (Applied 
Biosystems), using the manufacturer's recommendations, then re- 
solved using an ABI377 automated sequencer. Sequences were ed- 
ited using Phred, then assembled into larger contiguous se- 
quences using Phrap (Phil Green, University of Washington, St. 
Louis, MO) . 

b. Cosmid A2: DNA (1 \xl of 20 ^1 total in the ligation) 
was transformed into E. coii DHIOB by electroporation and cells 
were spread onto LB agar medium containing 50 mg/ml ampicillin. 
After growth for 18 hr at 21°C. transf ormants were transferred 
to 96-well growth blocks and shaken overnight at 37°C in 1.3 ml 
2x YT medium with 50 mg/ml ampicillin. Templates were prepared 
from these cells by an alkaline lysis procedure (Qiagen Qia- 
quick Turbo Prep) to yield purified, double -stranded plasmid 
DNA. Cycle-sequencing of the plasmid templates was performed 
using universal forward and reverse primers and Big Dye Termi- 
nator sequencing kits (FEBiosystems) or Thermo Sequenase fluo- 
rescent labelled primer cycle sequencing kiz (Amersham Pharma- 
cia Biotech) using the manufacturer * s protocols. In the shotgun 
phase of a cosmid, identical amounts of samples were sequenced 
either by dye-primer or dye- terminator chemistries (Pharmacia, 
PE Biosystems) . Data were collected using Licor and ABI 377 
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automaced sequencers and assembled with the GAP4 program (Bon- 
field, Smith, Staden, Nucl. Acids Res. 23, 4992-4999 [1995]). 
Gaps were closed using custom made primers {MWG- Biotech) on 
plasmid templates or PGR products in corr±>inat ion with dye- 
terminators . 

E. Bioinf ormatic Methods 

1. Open reading frame (ORF) identification 

ORFs were identified m the pEP0cos6 region using the OMIGA 
1.1.2 (GCG 0.4D) program from Oxford Molecular Limited. Default 
values were used (Standard genetic code, all ORFs over 50 
bases) to generate ORFs; analysis of these results lead to the 
list of 14 highest qualicy ORFs as defined in claim 9. Other 
ORFs, genes, or genetic elements may be found in the pEP0cos6 
insert that have not yet been annotated. In addition to hand- 
editing of the OMlGA-generated data, the MAGPIE automated ge- 
nome analysis tool: 

(http : / /oenomea . rockefe l ler . edu/macroie /magpie . html ) 

was used to identify genes for all the sequenced cosmids and 
plasmids. ORFs identified in this manner are presented as both 
nucleotide and peptide files below. 

For cosmids A2 and A5, ORFs have been identified within 
the DNA sequences of A5 (contigs 10, 11, 12) and of A2 using 
the FramePlot analysis program from Ishikawa and Hotta (FEMS 
Microbiol. Lett., 174, 251-253 [1999] public available, under 
[http://www.nih.go.jp/-jun/cgi-bin/frameplot.pl] v;hich is based 
on positional base preference in codons typical for organisms 
having genomes with a high G + C content (Bibb et al . , Gene 30, 
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157-166 11984]). Default parameters using ATG and GTG as start 
codons were used. The deduced amino acid sequence of predicted 
ORFs were compared with protein databases (GenBank, CDS trans- 
lations, PDB, SwissProt. PIR, PRF) using BLASTP (Altschul et 
al., Nucleic Acids Res., 25, 338S-3402 [1997]). Additionally, 
high scoring amino acid sequences were analyzed using the Pram 
program [http://www.3angsr.ac.uJc/Software/Pfam/], which identi- 
fied specific domain s-ructures of the submitted proteins 
(Bateman et al . Nucleic Acids Res., 27, 260-262 [1999]). 

2 . BLAST searches 

3LASTP2 similarity searches were performed using the peptide 
files from the above OR? identification strategy as query se- 
quences. Searches were performed using the in-house Bioinfor- 
matics BLASTP2 (Version: BLASTP 2 . 0al9MP-WashU) web page at the 
Bristol-Myers Squibb Pharmaceutical Research Institute (allows 
BlastN2, BlastP2, BlastX2 , TblastN, and TBlastX searches). In 
addition, peptide files generated by the MAGPIE analysis were 
automatically searched using a FASTA algorithm. 

3. Best match and probeible identification 

Analysis of the BLASTP2 and FASTA output led to an assign- 
ment of a best match and probable function. The best match was 
usually the top scoring r.atch, although sometimes another match 
was given because it was a more relevant homolog, or no match 
was found with a significance greater than >e-4. Probable func- 
ticr. represents the besz estimate of function given the initial 
analysis of the BLAST data and the published literature regard- 
ing Che best match, and may not necessarily represent the true 
func-ion. of the gene product (hypothetical proteins are of un- 
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known function) . A higher probability score indicates a higher 
liklihood that the probable function corresponds to that of the 
best match; e.g., the polyketide synthase matches are all above 
e-100, arid given the very high significance scores are presumed 
to function as polyketide synthases (as are the high scoring 
peptide synthetases) . 

The following is a summary of the sequence data from the 
pEP0cos6 region, pEPOcosS, AS, Sau4 and A2 . 

a. Data from pEPOcosS region; 

Summary: A large PKS/PS cluster spanning multiple cosmids. 
An IS element (designated IS-Scl here) is found in the cluster 
- this may be a- potential tool for genetic analysis of Soran- 
gium. 

Statistics: Sequence was assembled from over 2000 random 
sequences (forward and reverse reads of the ca . 2 kb cloned 
fragments derived) . 

4 7,713 nucleotides of contiguous sequence (no pFD666 vec- 
tor included) 

DNA sequence data are as defined in claim 7. 

Note: .pEPOcos6_ORF7 sequences (cf. claim 9): the predicted 
N-terminus .of 0RF7 shows 145 nucleotide overlap with 0Rr6 . 

Note: pEPOcos6_ORF8 sequences (cf . claim 9) : >pEPO- 
cos6_ORF8.seq ("0RF9_up" in Fig. 2) 
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57.3% G+C 

Table 3 shows ORF data summary. Note: pEPOcos6_OR?i , seq is 
truncated at its 5' end; correspondingly pEPOcos6_ORFl . pep is 
truncated at its N-terminus. 

b. Data from pEPOcosd region: 

Summary: Two PKS genes found on a cosmid. A TnlOOO inser- 
tion is also found (occurred during £. coli propagation) . No 
peptide synthetase genes were found; one P450 hydroxylase was 
identified. 

Statistics: 1952 random sequence reads from the pEPOcosS 
library were assembled using phrap, with 1024 of the sequences 
assembling into 57 contigs. 12 of these cpntigs were chosen 
(totaling 56,537 bp) which each contained >6 reads and con- 
sisted of about lOOO bp or more. The sequences of these 12 
contigs and the associated ORFs are given below. 

DNA sequence data from contigs are as definded in claim 
10. Table 4 shows more data. 

pEPOcosS protein data are as defined in claim 11, i.e. for 
selected ORFs (polyketide synthase, peptide synthetases, or 
ORFs with high similarity to known genes) . 
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c. Data from cosmid A5 insert: 

Summary: A cluster zz PKS and PS genes found on the cos- 
mid. Other genes possibly involved in this secondary metabolite 
production include a dov/nstream lipoxygenase gene higly similar 
to eukaryotic orthologs. 

Statistics: 880 random sequence reads from the A5 library 
were assembled using phrap, with 530 of the sequences assem- 
bling into 12 concigs. 3 cf these concigs were chosen (totaling 
41,556 bp) which each ccntained >100 reads and consisted of 
about 9000 bp or more. Tr.e sequences of these 3 contigs and the 
associated ORFs are giver, below. 

DNA sequence data from contigs are as defined in claim 12. 
Table 5 shows more data. 

Protein sequence data from selected A5 ORFs are as defined 
in claim 13 . 

d. Data from plasxoid Sau4 insert: 

Summary: Insert contains PKS genes on two large contigs - 
most similar to the soraphen PKS gene from Sorangium. 

Statistics: 565 random sequence reads from the Sau4 li- 
brary were assembled usir.g phrap, with 84 of the sequences as- 
sembling into 18 contigs. 2 of these contigs were chosen (to- 
taling 6596 bp) which each contained >10 reads and consisted of 
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about 1000 bp or more. The sequences of these 2 contigs and 
the associated ORFs are given below. 

DNA sequence data from plasmid Sau4 contigs are as defined 
in claim 14. Table 6 shows more data. 

Protein sequence data from selected plasmid Sau4 ORFs are 
as defined in claim 15. 

e. Data from cosmid A2 

Table 7 shows ORF data summary 

F. Construction of suitSLble recombinant expression vectors 

1. Expression in Myxobacteria 

Heterologous expression of the ORFs shown in Figure 1 is 
performed by using a derivative of plasmid pSUP102 (Simon, R. , 
Priefer, U. , Puhler, A., Methods in Enzymology (1986), vol. 
118, pp. 643-659). In this plasmid the gene for chloramphenicol 
resistance is changed for a cassette comprising the gene for 
streptomycin . resistance and the promoter element of the TnS 
transposon. Short homologous genomic DNA segments from the host 
organism are ligated with the DNA sequences of Figure 1 and 
with efficient regulatory elements into, for example, the EcoRI 
restriction site cf the vector. Following amplifiction of the 
vectors in Escherichia coli the DNA is transferee by electropo- 
ration of the host cells or by conjugation with Escherichia 
coli S17-r (Simon, R. , Priefer, U. , Puhler, A., Biotechnology 
(1S33;, vol. 1, pp. 784-791). 
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By means of the Ce-racycline or strepcomycin resistance, 
respeczively, mediated by the vector the host cells are checked 
for integration of recombinant plasmid DNA into the chromosome 
by homologous recombination. 

2 . Expression in Streptomyces cells 

Heterologous expression of the ORFs shown in Figure 1 is 
performed by using bifunctional Strepomyces-Escherichia coii 
cosmids pKU206 and pOJ466. 

3. Expression in Escherichia coli cells 

Heterologous expression of the ORFs shown in Figure 1 is 
performed by using "bacterial artificial chromosomes", cosmids 
(for example Supercos, Stratagene GmbH, Heidelberg) and T7 ex- 
pression systems (Stratagene GmbH, Heidelberg; New England Bio- 
labs Schwalbach, FRG) . Expression of recombinant enzymes occurs 
in Escherichia coli cells constitutively expressing phosphopan- 
tetheinyl transferase required for the formation of holoenzyme 
polyketide synthetases and polypeptide synthetases. 
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Table: 5. A5 &0 3QiBbly axiaJ.y03.s snTrnnary (continued) 
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Table: 7. ORF data atanaary from A2 insert 
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Claims 



1. DNA sequence, the expression products of which cause an 
enzymatic biosynthesis, a mucasynthesis or a partial synthesis 
of polyketide or heteropolyketide compounds or are involved 
therein. 

2. DNA sequence according to claim l, wherein the polyketide 
or heteropolyketide compounds are epothilones. 

3. DNA sequence according to any of the preceding claims, 
wherein the DNA is derived from myxobacteria . 

4. DNA sequence according to any of the preceding claims, 
wherein the DNA is derived from Sorangium strains. 

5. DNA sequence according to any of the preceding claims, 
wherein the DNA is derived from Sorangium cellulosum. 

6. DNA sequence according to any of the preceding claims, 
wherein the DNA is selected from the group consisting of: 

(a) the following DNA sequence: 

Seq ID No 1 (A2 cosmid) 

GGATCGCGGCGCCCTCGCGCTGCTCCTCGAGCGTGCGGAGGAACTCCCACGCCAGGCGCGACT 
TGCCGAGGCCAGGCGCGCCCACCACCACCACCGCGTTCGCGGAGGGCTCGTCGACGCAATGGC 
GCCACTCGGTCGCGAGCTGCGAGAGCTCGCGCTCCCGCCCCACGCAGGGCGTCGGCTTGCCGA 
GGAGCCGTGGGACGGCATCCGGCTCCTCCTTCGGGCCGCGAAGCCAGCACCCTCCGGGCCCCT 
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GTACCGTCTCGAAGCGGCTCGCGAGCAGGCTGGCCGTCGCGTCGTCGAGCCGGATCTCCGGCG 
GCGACAGGCCATCTCGCCCGGCGATGAGCTGCGCGACCCGATCGACCAGCTCGCCGACCGGCA 
GCCTCGCCTCGACCTCGGCCAGCCCTGTCGCGACGGACACGGGCACGCCTCCGAGCGCCGCCC 
GCAGCGCGAGGGCGCAGTGGGCCGCCCGTGTGGCGAGATCCGTGGGCGACTCGGCGCCGGACA 
GCGCGACGAGCCACCAGCGCGCTTGCAGCCGATCGAGGCGCCCGCCGTGGCGCGCCGCGATGT 
CCCGCAGCGCCTCGGCCCGCGCGGCGCCGTCGTCCTCCGAGAGCGTGGCGCCGGCCTCGGCGC 
CGCCGTCTTCGGCCAGGATGACGCACATCACCTTGCGCTCGGCCGTCGTGATCGCCTCGCCCG 

gcgcggccggcgccgcgaccgcgctcgccccgatcgagagccc::tcgccggccacggcggcga 

GCTCCGCCGCGGCGGCGGCGCCGTCGCGCGGCCGCTCTCCCGCGTTCTTCGCCAGCATCCGCG 

ccaccaggcgctcgagcggctccgggataccgtcgcggagctccccgagccgcggcggctctt 

ccaggacgacccgcatcaggagcgcgagcgcgctgttgccgaggaacggcgggcgccccgcga 

ggcactggaacagcacgcacccgagcgcgaacacgtcggcccgggcgtcgaccggcgcgtcgc 

cgcgcacctgctcgggcgctatgtacccgggcgtgccgagcacggccccgggcgacgtgaggg 

tcggcgcgagccggaggtggcgcgcgatgccgaagtcgagcagcgtgacgcgctcgaccgcgc 

cgcccacgagcatcaggttgctcggcttgaggtcgcggtgaacgacgccgagccagtggatcg 

cgccgagcgtcgtggccacgcgcgcggccagcgccacgctctcggccagcgtgagcggcgccc 

cggcgagccgctcctccagggtcacgccgtcgagccactccatggccaggtacggccgccctg 

cgccggtcaccccgtgcgccacgtactgcaccacgccgggcagccggagcgtcacgagcgcct 

ccgcctcccgcgcgaaccggcgcaggtcgttggcgctcgcgccctgcaagaccttgagcgcga 

ccgcctgcccggacacccggtcgcgcgcccggtacacgtcccccatcccgccggagacggcga 

gccgctcgatctcgaaacgatcctcgatcacatccgctgcgcgcatggcggtgccaatgtact 

ccgcgcgagcctcgggcccccgcgcgtaagtgcggccctgcgcccggttgaacgccagcccga 

gcgtgaccgcctcgcgctcgggatccacggccgccggatcggtccacgcctcgacgagcgcct 

gcgttgaacaacccgccaccgggcgcacgcagccggcatcgccgcgctggccacccggcgctg 

ccgcccttaggctcacctccgcgatgccccgctggttcaacacggcaggtccctgcaacccgg 

ccgatcactacatgctcccggccgaggagcgcttgcccgcagtgcgcgatctggtcgatcgca 

aggcctacttcgtcctgcacgccccgcggcagatcggcaagacgacctcgctgcgcacgctcg 

cccaggatctcacggccgaagggcgctacgtggccgtcctcgtctcggcggaggtcggcgccc 

ccttctctgacgatcccggcgcggccgagctcgcgatgctcgcagaatggcgcggcaccgccg 

gcgcgcagctccccgccgatctgcggccgccaccgttccccgatgcgcccgccggtcagcgca 
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TCGGGGCCGCCCTGCGCGCCTGGGCTCAGGCCGCGCCGCGCCCGCTCGTCGTCTTCCTCGACG 

AGGCCGACGCCCTGCGCGACGCGACGCTCGTCTCCCTATTGCGCCAGATCCGCAGCGGCTATC 

CCGACCGCCCGCGTGACTTCCCGCACGCGCTCGCCCTCGTCGGCCTGCGCGACGTGCGCGACT 

ACAAGGTCGCGTCGGTCGACAGCGGCAGGCTCGGGACGTCGAGCCCCTTCAACATCAAGGTCG 

AGTCGCTCACGCTGCGCAACTTCACCCGCGACGAGGTCGCAACACTCTACGCTCAGCACACGG 

CCGAGACCGGTCAGGTCTTCCGGCCGGACGCCGTGGACCGCGCCTTCGAGCTCACCCAGGGCC 

AGCCGTGGCTCGCCAACGCGCTCGCCCGCCAGCTCGTCGAGGTCCTCGTCAAGGACCGCGCGC 

AACCCATCACGTCTGCGAACGTCGATCGCGCCAAGGAAATCCTCATCGAGCGGCAGGACACAC 

ACCTCGACAGCCTGGTGGATCGGCTGCGCGAGCCGCGCATCCGCGCGGTGATCGAGCCGATGC 

TCGCCGGCACCGCGTTGCCGAGCGTGCCCCCCGACGACCTTCGTTTCGCGATCGACCTCGGCC 

TCGTGCGCATGACCGCGGAGGGCG3CCTCGACGTCGCCAACCCCATCTACCGCGAGATCATCG 

TCCGCGAGCTCGCGTTCCCGATCCGCGCCTCACTCCCCCAGATCAAGGCCACGTGGCTCACGC 

AGGACGGCCGCCTCGACGCGGACCGCCTGCTCGACGCCTTCCTCTCCTTCTGGCGCCAGCACG 

GCGAGCCGCTCCTCGGCGCCGCGCCCTACCATGAGATCGCCCCGCACCTCGTGGTGATGGCCT 

TCCTCCACCGCGTGGTGAACGGCGGTGGCACCGTCGAGCGCGAGTACGCCATCGGCCGGGGCA 

GGATGGATCTCTGCGTTCGTTACGCGGGCGAGACGCTCGCGATCGAGCTCAAGGTCTGGCGAG 

ACGGCCGCCCCGATCCCGTCGCCGAGGGGCTCGCCCAGCTCGACGAGTACCTGGCCGGCCTGG 

GCCTCGATCGCGGATGGCTCATCCTCTTCGACCAGCGCTCCGGACAGCCCCCCATCGCCGAGC 

GCACGCGCCGCGAGCGCGCGCTCTCCCCCGCCGGCCGCGAGGTCGCCGTCATTCGCGCCTGAG 

GGAGCTCGCCGCGCGGCGAGCGCCCTCCACGAGGGCCGGGCCACCTCGGACAGCGTCTCTACT 

CCTCCGAGGCCGCCGCGCCCCCCGCCCCGGCCGCCGCCGCCGCCGCCGGCTCCAGCTCGCAGC 

GCACCACCAGGACCTCGCCATCCGCGAGCTCCGGCCGCTCCACGAGCGCGTGCGCGCCCGCGC 

GCACCGCCGTGAGCACGTCTCCCAGCGCCGGCTTCAGCCGCGCCAGCGTCGCGGCGTTCGCCC 

CGAGCGCGAGGTCGGTCACGACGGGCCCCACGCTCGCGCCGAGCTCGCTCTTGCGCTTGTTGA 

CCGCCGCCATCGCCGCCGCCGCCAC-ATCCAGGAGCCCCGGATCCGAAGGCGCCGCGACCGCCG 

CGAAATCCGCCGCTGAAGGCCACTTCGCCCGGTGGATCGAGGTATCGCCCGTCTCCTCCGCGT 

ACACCCAGCGCCAGACCTCGTCGGTGATGTACGGCAGGACCGGCGCGAACAGCCGCAGCAGCA 

CCGACAGCCCGAGCCGCAGCGCCGCCACCGCCGAGCCGCGCGCCGCCTCCCCGGCGCCGCCCT 

CGCCGCGCGCCCGCGCCTTCGCGA3CTCCAGGTAGGCGTCGGTGAACCAGCGCCAGAAGAAAT 

CCTCGGTCCGCTCGAGCGCCGCCGCGAACTCGTGCTCGTCGAACGAGCGCGTCGCGTCGTCCA 



wo 00/22139 



PCT/US99/23535 



48 



CCACGGCCGACAGCTTGTGCAAGAGCGCCCGGTCGAGCTCCTCGGAGATCGGGTGGACCTCCG 

CCGACTGGCTGAGCACGTACTTGCTCGCGTTCCAGATCTTCGTGACGAGCCGCTTGCCGATCT 

TCAGCACCTTCTCGTCGAACGCCGTGTCCGTGCCGAGCCGCGCGCTCGCCGACCAGTAGCGGA 

CCGCGTCCGAAGAATACGTGTCGAGCAGGTGCATCGGCGTGACGACGTTGCCCTTGCTCTTCG 

ACATCTTCTTGCGATCCGGATCGAGGATCCACCCGGAGATCGCGACGTGGTGCCACGGGACCG 

ACGACTCGTGCAGCATCGCCTTCGCGATCGTGTAGAACGCCCACGTCCTGATGATGTCGTGGG 

CCTGCGGCCGCAGATCGGCCGGGAAGAGCCGCGCGTGGCGCGCCGGATCGTCCCCCCAGTGAG 

AGCTGATCTGCGGCGTGAGCGAGCTCGTGAACCACGTG7CGAAGACGTCGGACTCGGCGGTGA 

AGCCGCCGGGCTGGTCCCGCTGCGACGCCTCGTACCCGGGCGGCACGTCGACCGTCGGGTCGA 

CCGGGAGCATCTCGCGCGTCGCGAGCAGCGGCCGGCTGTGATCCGGGTTGCCCTCGGCGTCGA 

GCGGATACCAGACCGGGAACTGCACGCCGAAATACCGCTGGCGGCTGATGCACCAGTCACCCT 

GGAGCCCCTCGGTCCAGTTGCGGTACCGGAGGCGCATGAAATCCGGGTGCCACTTGATCTTGT 

CGCCGTATTCGAGGAGCTCGGCCTTCTTGTCGGCGAGCCGGACGAACCACTGCCGCGTGGGCA 

CGAACTCGAGCGGCTGGTCGCCCCGCTCGTAGAACTTCACCGCGCGCTCGATCGGCCTCGGCT 

CGCCCCGCAGCGCCGGCCCCCGGCCGGGCGCCGCCGCGTGCTCCTCGCGGCGGAGCAGCTCGA 

CCACCGCCGCGCGCGCCTGCTTCACCCCCCTGCCCTGGAGCGGCGCATACGCGGCGTTGGCCG 

CGGCCGGGTCGCGGCTCTCCCACGCGCCCTCGCCGAACGTCACCGGCAGGACACGGCCGTTCT 

TGCCGAGCATCTGCCGGAGCGGGAGCTTCTGCTCCCGCCACCAGATCACGTCGGTCGCGTCGC 

CGAAGGTACAGACCATCAGGATGCCCGTGCCCTTCTCGCGATCCACGAGCGGGCTCGGGAAGA 

TCG3CACCGGCGCGCGGAAGATCGGGGTGAGCGCCGTCTTGCCGAAGAGGTGCTGATACCGCG 

GGTCCTCCGGGTGCGCCGTGACGCCGACGCAGGCCGCGAGCAGCTCCGGGCGCGTCGTGGCGA 

TGACGAGCTCCTCGGCCGTCCCCTCCACCGCGAACGCGATGTCGTGGAACGCGCCCGATTGCG 

GGCGATCCTCGACCTCCGCCTGGGCGACCGCGGTCTGGAAATCGACGTCCCACATCGTCGGCG 

CGAAGACCGAGTAGAGGTGGCCCTTCTCGTGGAGATCCAGGAACGACAGCTGCGCCGTCCTGC 

GGCAGTGATCATCGATGGTGGCGTACTCGTTCCGCCAGTCGACCGAGAGGCCCACCCGGCGGA 

AGAGCGCCTTGAAGACCTGCTCGTCCTCGCGCGTGACCTTGTGGCAGAGCTCGATGAAGTTGG 

GCCGCGACACGATGCGCGGCGGCTCCTTCTTGATCGTCTCCGGCGCGGCCTGCGGCAAGGTCA 

GGCC3CGCTCGTACGGCGTGCGCACGTCGGTGCGGACGTGGAAGTAGTTCTGCACGCGCCGCT 

CGG7GGGCAGGCCGTTGTCGTCCCAGCCCATCGGGTAGAAGATGTTGAAGCCGCGCATCCGGC 

GCTGGCGGACGACGACGTCCGTGTGCGTGTAGCTGAAGACGTGGCCGATGTGCAGCGAGCCCG 
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AGGCGGTCGGCGGCGGGGTGTCGACGACGAAGGTCTCCTCGCGGGGGCGCGACGGGTCGTATC 

GGTACGTCCCGTCGGCCTCCCACAGGTCGGCCAGGCGCAGCTCGGCGACGGGCGAGTCGAAGT 

GCTTCGCGAGCGTCGCGGGATCGATGGAGCGGAACGTCTTCTTGATCGTCACGTGGTCACCTG 

CAGAACAGACCCCGCAGGAACCGCCCGCGGGGCCGGCATCCTACGTCGTCCCCCGGGTGCCGC 

TCAAGGCGCGCCGCGCCCGCGCGGCGGCGATCCGCGATCGCATCCGCGCATCCGCCAGAGCCC 

GGCGGCTCCGCCGGCGCGCGCGCGCCGTCCGTGGAGCCGAGAGGAGAGGCCGGCGCCCAGGTC 

GTGGAGGACGCCGGCGGCGCCGCCGCGGAGATCGCGGAGAGGCGGGCGCATCGATCGCGGCGA 

GGCCGGGGGCTCAGTCGTAGCGCTCGACGTGGACGTGCTTGCGGTGGACGCCGAGCTCGCCGC 

GGGCGAGCTCGCGGACGGACGAGACCATCCGATCCAGGCCGCAGATGAAGACGTGCGGCGCCG 

GATCTCCGCTCTTCTCCGCGAGCTCCCGGTAGAGCTCGGGCACGTGCGCCTGCACGTAGCCGC 

GGCGGCCGGCCCACGACGGGCCGCCGCGCGAGAGCGTGATCTCGTAGCGGATCCGGTCGGATC 

CGCGCGCGAGCGCCTCGAGCTCGTCGCGGTAGATGACGTCCTCCTCGAAGCGCGCGCCGAACA 

GGATCCACAGGTGGGGCGCGGCCAGCCCCGCGCGCAGGGAGGCGCGCAGCATGCTCCGGAGCG 

GCGTGATGCCGGTGCCGGTCGCGACGAACAAGGAGGGCGCGGAATCCCCGGGATCGCGGGTGA 

AGAGCCCGTGCGGGCCGATGGCGCGGAGCGTGGCGCCGGGCTCGAGCCGGTGCAGGTGCTCCG 

AGCCCGCCCCGCCCTGCACGAGCGTGACCGCGAGATCGAAGCGGGGCGAGCCGTCGGGCGCGG 

ATGCGATGGAGTAGGCGCGCTTCACCTCGCCGCCCGGGAGCGGGAGGACGAGGTTGACCCACT 

GGCCCGCCTCGAACAGAAACGACCTCCCGTCGGCGCGCTCGAACGAGAGCTCGCGCACGAAAG 

GGCTGAGGGGCCGGGCGGCGACGAGGCGGGCTTCGAACGGTTCGGCGTGGATCATGGTCGGGG 

CCCGGCGGGGCTCGGCTGCGAGGCCGCGCGGGTGGCGAGGTCTTACCGCAGCCTGCGCCCCGG 

CCCAATCGCGATCGCCGCGGGAAGGGCGCCGCCGGAGGGCGCGCAATCGCGGGAATCACGGGC 

TTCCGCCCCGTGCGCCGCCGGAGCGCGCGGCCGGGCCGCCGGCCCGCGCTCCGGCGGGGAGCC 

GTCGCGGGCTCTACCGCACGCCCATGCGGCGGCGCTGCGGGATGTTCACCGCCGGCCGGGAGC 

GATCCTGGTTGGGGAGCGCGCGCGGCGGGCGGGGATCCCGGTGCGCGGGCTTCTGCGCGGGGA 

GCTGCCCTCGCTGAGCCGGGCGCTGGTCGGGCGACTTGGCCGAGCCCAGCGCGAGATCGGAGA 

CGGGGAGATGCGCGCGTCGCTGCATAGAATCCTCCATGGAATCGGTCATCAACACATCGGGAA 

GAGCACCCAGGCTGAAAGAAACCTTCGAAGAACCGGCTCTCATACACCCTCCATTCATCGTGC 

GACCCCGGATTCAGGACGGATCGAACCCGCGAGGGACGCTGGCTCTCTGGGCCTCTCCCTGCT 

CGCTCGACCGGCGCCCTCTCGACGCAACTCCGCCGTTCGTCGGGACGGGACGGTCCGCCTCGC 

CGCACGCTCCCCGTCGAGACGACTCAGCGTCTCGACGTCAGGAGAGATGACGACTCGGCCCGT 
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CGCGCCACGACCCTTCCGGCTCGGT3CTTCGAGCGCGCGGCCAGCGAGCGAGGGGCGATCGCC 

AGGAGATCACGAATCTCCCGGCCATCGGCCTCCAGCGCCTCGGGCTCGTTCGCTCGTCGCCCC 

GCTCCGTCCCCGCGCGCGCACGACGCGAGCTCGCGCGGGGAACCGCGGGCCGCTGTCGTGGCT 

GCTGATGCGCGACGATACAGGGGGGACGCCGTGCCTACCTGGGCAACAGGCGCTCATCTTCTA 

CCACGGCGAGCACTACGGTGAGTGCTGCCATGAGTAGGCCCCTGAGGGTCCGCGCGACGGAGC 

GTGGTGTCAGCGAGAGATGCGCATGGTGGACGCGGGCTACGCGTCGAGAGGGACACTAGCACT 

CGACCTCGATCCTGCCCAGCACTTTTTGTCGGGGAGGGCTGCCCTCCCGCTGGCCGCTGGCCG 

CTGGCCGCTCGCCGCTGGCCGCTCGCCGCTGGCCGCTGGCCGCTCGCCGCTGGCCGCTGGCCA 

TGTGCGACGTGAGCTCGAGCAGCCCGCGGCTGACGGACAGACCCCGGAGTTCATCGAGCCGGT 

GATGCCGAACCCGCCAAGCGAAAAAACGTATCCGTTCGGCAGGTCGTGGCCTATCATGCAAGC 

TGCTCGATGCGCTGACAGGCTTCTTCGAGATCCTCGTCGGTCTTTGCGAAGCAAAACCGCATG 

AAGCGACTCCCCTGCGTCCCTTCAAAGAAGGCGTCGCCTGGCACGCCCGCCACCCCGGTCTCG 

TCGAGCAAGTAGATGGCTCGCTCTCGACCTGTCCTCCCGGGTAGGCGAGACACATCCGCCAGC 

ACGTAGTACGTCCCCTGCGGCACGCAGGGTGGCAAGCCCGCTTTCTCCAGCGCCCGACAGAAC 

CGGTCTCGCTTCCGTTCATATCCCTGGGCAAGCCCCGTGTAAAACGAGCGAGGAAGGCCGCGG 

ATCCCGGCAGCGACTCCATGCTGCAGCGGCGTCGGCGCGCAGACATACAGCAGGTCGCTCATG 

GCTCCAATGGCCTTCGCCCACCTGGCATCGGCCACGCTGTAGCCGATCCGCCATCCTGTGATG 

CTGAAGGTCTTCGAGTAGCCGCCTATCGTGATCGTACGCTCGGACATGCGCGGAAGGGAGGCG 

ACGCTGACGTGCTCACGGCCGTCGAAGATAAAGTACTCGTAAATTTCGTCCGTGATCACCATG 

AGGTCATGGTGGCAGGCGAGATCGGCGATCTGTTCCAGCTCCATTCGGCCGAACACCTTCCCG 

GAAGGATTTCCAGGAGAGTTCACCACGATCGCCTTGGTCTTCGGGGTGATCGCGCGCTCCAGC 

TCGTCGCCGTCGACATTCCAGCTCAGGGATCGCGCCGTCACATACCGCGGAACAGCCTCGACG 

GCGAGGATAGCCTGGGCGTGATAGGCATAAAACGGCTCGAAGAGCAGCACTTCGTCCCCAGGA 

TTGAGCAAGGCCATGCAAGTGGCCTGAAAGGCCCCTGTCGCTCCGGCGCTCACCGTGATGTCA 

GTCTCCGGATCCGCCGCGATGCCATTATGGCGAGCCAGCTTCGCCGCGATCGCATGGCGCAGC 

TCCACGATGCCGTCGAAGCGCGAATATGTATTGCACCCCCGATCCATCGCCTCCTTCACCGCT 

TGAAGGATCACCGAAGGAACTGGGGTATCACAGACGCCCTGGGACATATTGATCCCATGGACC 

TTGGCGCACGCCAGGGTCATGGTAC3GATATCGGACTGGGCGAGGCGAGCCGCACGATCACTC 

GGTAGACTCTTCATCAGCGTGCTCCrGCTTCTGTTCTGCGGCTCTGCATGGTGTCTTCGGGTG 

GGCTTGTCAGCTCGACGCGCCCATGCAGCGGCGCAGCCCTAGCGGCCGCAGGTCTGTCCACAC 
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TTCTTTGATGAAAGCGAGACATTCGGCTTTCGTGCCCTGTTTGCCCGCAGCCCTCCAGCCCCC 

AGGTACGGGCTTGTCGGCGGGCCAGATCGAGTACTGCTCTTCGCCGTTCACCACGACCTGGCA 

ACGCGTCTTGCTTTCGTCGTCCCGATTCATGATTTTCCTCGCCCTTCGTCAGCGCTGCGCGAG 

CATGAAACGAATCGCTCATCGGCGCACAGGCGCGCGCCGGCTGCCCGGAGGCACTCCCACGCC 

TCCCTCACGGCAACCTCATCGCTCCGGATGTTCCCGATGGCGACTCGGATCGTGTACCTGCCG 

TGGAGACGGGTATGGGACAAAAATACCCTGCCCGACTTGTTGACCTCGTCCAGCAGCGCCTCG 

TTGAGGCGATCGAGCTCGCGTTCGA7CGACTCTCTCTCTGCCTCGTCCGCCGACCGCATGATG 

CAAGCGAGCGCGGAGGGCCTCATGCGAAAGCAGACCGTACTGAACGGCGTCGGCGCGAGGCGC 

TCCCAATCGGGATCGGCGTCCACCCACTGGGCGAGCTGCTGCCCCAATCGGAGGTGCTCCCGG 

ATCCGGGCCGCCAGCCCTTCATGCCCGAAGTAGCGCACGATCATCCAGAGCTTCAGCGCTCGG 

AAGCGCCGACCGAGCTGGATACCCCAGTCCATGTAATTCGTGACGTCGCCCTCGGTGCGGAGG 

TATTCGGGCACCAGACTGAACGCGCGCTTCAGTCGGTCGGCGTCACGCACGTAGAGCACGCTG 

CAATCCATGGGGGTGAACAGCCACTTGTGAGGGTTCACTACCAGCGAGTCCGCCCCCTCGCAG 

CCCGCGAGCACGTCCCTGTGCTCGGGGACGATCGCGGCCATCCCCGCGTAGGCCGCGTCCACG 

TGAAGCCATAGCCCGTGCTCCCGGCAAACGCTGACGATGGCGGGGATGGGGTCGACGCTCGTC 

GTGGACGTCGTGCCCACCGTCGCCGCGACGCAGAAGGGTCGGAGGCCGGCCCCGAGGTCCTCC 

ACGACGGCGGCGCGCAGCGCCTCGGGGACCATGCGGAAGGCCGGATCCGTGGGGATCTTCCGC 

ACCCCCTCCTGCCCGATGCCGAGGGTGATGGCTGCCTTCTCGATGGATGAGTGCGCCTGCTCC 

GACGCGTAGAGTCGCATGCGCCGCrGTCCCGCCATGCCCCGGAGCCGGATGGTCGGCTCGGCC 

GAGTCGCGCGCGGCCGCGATCGCGACCATGCTGGCGGTCGACGCGGTGTCCATGATCGCGCCG 

TGCAAGCCGGCGTCGAGATCCAGCATCTGACGCAGCCAGGAGAGGACGAGCTCCTCGAGCTCG 

GTGGCCGCCGGCGACGTGCGCCATAGCATCACGTTGACGTTGAGGCACGCCGCGAGCAGCTCG 

CCGAGGATCCCAGGACCAGACGCCGTGTTCGCGAAATACGCGAAGAATCGCGGATGATTCCAG 

TGCGTGATCCCCGGCAGAATGATCTGCTCGAAATCGGTGAGCACGGCGTCCATCGGCTCCGGC 

TCGACGGGCGGGGTCGGGGCCAGCCTGCCCTTCACGTCGCCGGGGCGGATCGCGGGAAAGACG 

GGGTATCGATCCGGGTGGCCGAGGTAATCGGCCGCCCAATCGATGATTCTCATACCGATCCGG 

CGGAACTCCTCCAGATCCATGTCCCCGAGCCGTTCTTTCCGCGGGTCGCTCACGTCAACCTCC 

TCGCCCTGCCAGGACAGGATCCTCGAGGTCCCCTGGCTCCGGCGGTGGAAAGCGCTCCTTGAA 

CGTGAAGGCCCACGGGGTCGGTCCGTAGCGCCGCAGGTGCTCGAGCCGATCCTGCCCCTCGCG 

GACG3ACGGGATGTGCCCGGCCGGGACCCACCACAGCACGAGGTAATGCGGCTCGAGATGCTC 
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GAACCACCGAGCGCGCTGTCGCAGGAACGCGGCATGATCCGCGGTGTAGGTGAAGGCGAACAG 

GTGCTCGATGGAGGTCCATACCGACAGGGTCACGAGGAGCCGCTGGTCCGGGTACGGACGGAT 

GGACACAGAGTTCCCCTCGGCGGTCTGCAGGCGCCACACGAACCCCTCGCTCCGATCGGCCAG 

ATGGTTGATATGGTCGAGCCCCTGGACGAAGCCCTCCATGATCGGATCCTCCAGCGGAGCGCG 

AATACATGCGAAGTTGTATTGCGCGATGTGGTGCCGATGCTCCGACATGTCGCTTTCCATCTC 

CAGCTCCCGCTCACCAATCCCAGCGCTGCTCCGGGGAGCTCATCAGGGCAGACGCGACATCGA 

TCCCGAAGCTCCGCCGCATCCCCTCGACGAAGGCGGCCTGGACCGCTTCGGCGACGGATCGGC 

CTGCCTCCGGCAAGACCTCGGAGACAAAGAAGAACCGCCTCGTGGAAGGGACAATCTTGCCCC 

GCTCCGCCTGGCGCCATACGAAGTGCCTCGTCACCAGTCCCTCCGCGTCGGCATACCCGACCT 

CGCCGGCGCCGACCGCCACGCTGCCGCCTGAGCCGAGCTCCACGAACGCCTCACCGCCTCGCG 

AGATCTCGAGGCGAACGTCCGGGCCAGCCAGATCGCCGAGGTCCCAAGCGCCGACGGGGACGG 

CGAACCGCAGCGACAGGAGGTTGTAAAAATCGACGAATGCGTTGATGTGCGGCAGCTCTCCAC 

CACCGAGGACCCGCTTCGCCAGCGCCTCGATCGAGCTCGGAAATTTCTTGCCAGAGACCCCCA 

CTCGCTTCATCGCCTCGCGCCAGGCAGCCACGTGCGGATGCGACTGGGCGTTTTCGTGGCCCC 

AGCTCCGTCGCAGCTCCTCCTCGACCTTCCGGAGCTCCTCCAGCACGGCCGGCCGCTCTGCGG 

CGTTGTCCAGGCCTTCCCCGTACCCGGTGACCAAGATCATCCCAGGAAACGACTCCCAGATTC 

GCGGATCGACGATGAATGCCATGTGCCTCCTGCCCCTCGAGAGCGATCGCCTCGATCGACACC 

AGGCTGTGGATGCATGAGCCGGGCCGTGCGGACGCAGGACCCCGCTACTCATGGCTCTTCGTG 

GCCGATGAACAGGTCCTCCACCCGTCGATCGTGCTCGGTGCCCCGATCCGTCCAGTCCCACCC 

GCCGGCGACCGCGATGTTTGCACCCGAGACGTACGAGGCGCGGTCGGAGGCGAGGAACGCCAC 

AGCATCTGCGACCTCGCTGGCGCGCCCCAGGCGGCCCATGGGGACGCGCCGCTCCATCCACTC 

CTTCTGCGCGGGCGGAAGGTATCCGTTGTCGATGAGCCCTGGAGACACACAGTTGACCAGGAT 

TCCATGAGGCGCCTCCTCCGTGGCCAGGCTGCGCGTGAGGATGAGCACGCCGGTCTTCGCGAT 

CGAGTACGCCGCCACGTTCGGCGCGCCGCGGATCGCGTACGTGGGGCTCAACCCGATATTGAT 

GATCCGGCCGCTCTTTCGCTGGCGCATGCGCGCCACGGCCGCGCGACAGAGGTAATGAACGCT 

GCTCAGGTTGCTGTCCATGACGTTGCGCCATTCGTCGTCGGTCATCGCCGCAAGCGGCTTGAA 

GAAGAAGTCGCCCACGTTATTGACGAGGATGTCGATGGGGCCCAGCTGCGCCTCGACGCTGGA 

GAAGAGCTCCGCGGCCGCGTTGGGGCGGGTGACGTCGGCCTGCACCACCATGGTTCGTCGCCC 

GAGCGCGCGGATCTCGGCCGCCGTCTGCTCGGCCGCATCCTTGTTCGAATGGTAATTGACGGC 

GACGTCCGCGCCTTGCTCCGCGAGGCGCAGCGCGATCGCCTTGCCAATTCCGCGCGAGCTACC 
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GGTGACCAGGGCGACGCGCCCGGCGAGCTCCAGCGATCGCGCC7GTGGCAGGGCCGGAGCAGC 

CTCCTGGTGGAGCTCGACGTCGACGGGGAGCTCCACGTGGTAGCTCGTCTCTCGCGGAGCCGC 

GCAGTACCTCTCGTAGAACGCCTCGAGGACGGGCTCTTCGCGTCGCATGATGTCCGCGTGGGA 

TTCGGCGCTGCGCCACGGATAGAGCACCAGGATCTCGTCGGGGCGCACGGTGCTCTGAAAGAA 

TCGCGCATGACCGCACCACCCCGGGTGTTCGTGGGCTCCTGGGCCGAGCAGATCATCCATTTT 

TTGCATGATCTGCGTGGCCTCGCCCTCCATGCCGGGTTTGATGCGCCATCGCTCCATAACGAG 

GATCATGTCTTGGCTCCTGTTCGTCATCGCCGTTTCGATCTGGGGGGGCTGCCCGCGCTCTCG 

AGGGCGCGCCCCTTGTATTGGCCGCGGATGGTCTGGGTAGCGCTCGCGAGCTTTCGCTTGTGG 

GCGGCGTTCAGGCTTGCGCCTTGATTGACGAACCGCTCGCAGACGAATGCGTGCGATTCATAT 

GCGGTCGCGAGCGCCCACAGGTAGAGACGGTGGCTACGAGCGAGCTTGGGCGGAAGQGCCCTC 

AAGGCGGGTCGGATGGATTTGAGGGTGCGAATCAAGCGGGCGTGCTCTTCGAAGAAGAGACCG 

CTGAATCCATCCTGGACGAACGGCGGGGCCATGGTAGGTCGCACGATGTTGTTGTAATCATCA 

TTCGAGAAATCGCCGGTAAACCGGAAAGATGCCGCCGTTGCCCACCAGAGCTGCGTGGAGATC 

GCGAGCGCCTCTCGCGCCCTCGGAACGTCACGGTTGGCGAGCGCCTCCGACAGCCACCTGTGC 

GCCATGAGCAGCGCCTGGACCAGCACGAAGAACGCGTGATGCCCCAATACCCAGCGGCCCAGC 

GCGCCGTCCGGAGCGCCCGCCTCCGCCGGAGGCGCAGCCAGCTGCGGGATGCCATTCCATGAT 

TTTGGCCTTCGCTTGCCGGAGAACTGGTGGAGGATGTCCTCGATGGACGCACAGAGATGCCCC 

ATTTCCATGGGTTGCAGGGAAGTACCTTTCAGGCTTTCGCGGATCATTCGGTAATATGCGACG 

ATCACCGCTTCGCAGTACGTCTCGAGCGGGTCTGGGGCGGAGACCCGGTGCACATGGAAATAG 

GCGTCGTACTCCGCCTCCCGGTCCGACAGGCTCCCCGGGATGTCCGGATCTGCCGCTTGCGCC 

TGCCATCGCTCGAGGATGGGGACGGCGAGGGGGGCGAGATGCTCGGCGAGCACCGCCAGGTCG 

CCCTCCGGCGCATGCGCCAACAAGACCTCGAAGGCCTCGGCTGCGACCTTGGAGGTCTCGCCC 

ATTCCGACGGCCTCGATGGCTTGGGGCAGCGGTAGATGGATGGTATATTTAGCCATGATTTGC 

CCGAAGATTGCCGCTGCGTCGACAGATCTTTCGCGAGCCGGAACGCCATTTCCACTGCTCTGG 

CTCTCAATATTGAATTGAGCCCTGGCGACTGCCATAGGCCCAGTCGCTCGACACAGTGTACGG 

AGCGGCCCGATGCTTTCTCCTTTTTTAGTCCTGCACCGAATACTTCTGTTGGGCGCCAAAGAT 

CCCTTGCCGAGACTGTCCGGCGAGATGTCGTGTGCGAAGCGTCCGCACGTCCAGCGGGCCCAT 

GCGTTGCTAGAGCATAAAACGGTTCGATGCCTGGTCGAGAGGGAGACGCGAGGAGCCTCCCTT 

TC-GGACGGATGAGGAATTTCGTGACCGAAATGTCGGCAGGAACAGCGGCGCAGAAGCGGCGCA 

TCGATGGGGAACCATGGGTTACGAAGACATTGATGATAATGTCGACGCAATCGCAATCGTCGC 
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GATGAGCGGCCGCTTCCCCGGCC-CGAGAAACGTCGAGGAGCTGTGGCAGAAGCTCCGCGCTGG 
CGTGGAATGCGTCGTCACCTTCACAC-AGGCCGAGGCGCTCGCCGCGGGGGTGAGCCGCGAGAT 
GCTCGCGAATCCCAGCTACGTGCGCAGAGGCGCGCCGCTCGACGGCGTGGAGCTCTTCGACGC 
CTCGTTCTTCGGGTTCAGCCCGAGG3AGGCAGAGAGCATGGATCCGCAGCAGCGCATCTTCCT 
GGAGGTCGCCTGGGAGGCCCTCGAGCGCGCCGGTTACGACCCCGATGCCCATTCCGGGCCTAT 
CGGCGTCTTCGCGGGCAGCGCCCCGAGCGGCTACCACTCCCTGGCGCAGTCCGACCCGGAGAT 

cctaggcgccctcggccactacca;^.ctgacgctgaacaacgacaaggattatctcaccacaca 

CGCCTCGTACAAGCTCAATCTGCGGGGCCCGAGCGTGTGCGTGCAGACGTCCTGCTCGACCTC 

gctcgtggccgtggtcatggcctgccagagcctgctcaaccacgagtgcgacatggcgctcgc 

GGGTGGCGTGGGGATCCATGCGCATCAGCGGAGGGGCTATCTGTATCAGGAGAACGGCATCTC 

ttcgcccgatgggcattgccgcgccttcgatgtggccgccaagggcaccgtgggcggcagtgg 

CATAGGCATCGTCGTCCTGAAGCGGCTCGCCGACGCGCTCGCCGACGGCGACCACGTGCACGC 

ggtgattcgaggagcggcgatcaacaacgacggctcgagcaagatcggttacacggcgccgag 

CGTGCAGGGGCAGGCCGAGGTGATCGGCATGGCCCAGGCGCTCGCCGGCGTGGAGCCGGATGA 

catcagctacatcgaggcgcacggcacggggacgccgctcggcgatcccatcgagatcgcagc 

CCTCACGCGCGTGTTCCGGGCGAAGACCGCACGAAGGCAGTTCTGCGCCATCGGCTCGCTCAA 

gaccaacctcggccacctcgatgccgccgcgggcgtcgcctcgctgatcaaaacggtcatggc 

CCTCGAGCACCGCGAGCTGCCCCCGAGCCTGCACTTCGAGCGTCCGAATCCGAAGCTCGAGCT 

ggagagcagccctttctacgtcaacacccgcctcactccgtggcacgcggcacgaggtccgcg 

CCGCGCTGGCGTCAGCTCGTTCGGCATCGGCGGCACCAACGCGCACGTGGTCCTCGAAGAAGC 

TCCGGCCCCGCCTCCGAGCGGCCCCrCGCGGCGTTGGCAGCTCCTCACCCTCGCGGCTCGCTC 

CGAGGCCGGGCTCGCGCGGGCCACGGCCGACATGATCGAGCACCTCGATCGCCACTCCGGCAC 

ATCGATCGCCGATGTCACGTACACGAGCCACGTGGGGCGCCGGGCCTGGCCCTTCCGGCGAGC 

GGTCGTCGGCGAGAGCGCCGCGGATCTCCGCGCCGCGCTCGCGAGCGAGGGCTCGCCGCGCTC 

GATCTCGTCATGCCAGGCGGCGAGG3AGAGGCCCGTCGTCTTCCTGTTCCCCGGTCAGGGAGC 

GCAGCACCTCTTCATGGCGCGGGAGCTGTACGAGGTCGAGCCGATCTTCCGGCAGTCCCTCGA * 

CCGCTGCGCCGAGCTCCTGCGCGGCCCGCTCGGCCTCGATCTGCGGCAGGTCCTCTACCCCGC 

CGAGGGGCAGCGCGACGACGCCGAQCAGGAGCTCGGTAGGACCGCGATCGCCCAGCCCGCGCT 

GTTCGCCATCGAGCTCTCGCTCGCCAAGCTGTGGATGGCCTGGGGGATCGTCCCCCAGGCGAT 

GATCGGCCACAGCGTCGGCGAGTTCGCCGCGGCTTGTCTGGCGGGCATC7TCCGCGAAGAGGA 
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CGCGCTCCGCCTCGTCGCCGAGCGGGGCCGCCTGATGCAACAGATGCCGCCCGGCGCGATGCT 

GGCGGTGCGCCTCGCGGAGCCCGAGCTCGCCCCCTACCTCAGCGACGACATCTCGCTCGCGGC 

GATCAACGGTCCGGCTCTCTCGGTGGTCGCTGGGCCGATCGAGGCCATCGACGCGCTCGCGGC 

CGAGCTCTTGGACCACGGGCTCTCGTGCCGGCGACTCCACACGCGGCACGCCTTCCACTCGAA 

GATGATGGCCCCCGTCGTTGACGCCTTTACCCGATGCGTGTCCGCGGTCGAGCGCCGCCCGCC 

GTCAGGCCACTTCCTCTCGACCCTGACGGGCGGCTGGATCTCCCCCGAAGCAGCGACCATCCC 

CGCATACTGGGCCCGGCAGCTCGTGGAGCCGGTGCGCTTCGCCCAGGCCGTGAGGCAGCTGCT 

GTCCGAGTCGACGTGGCTCTGGCTCGAGCTGGGTCCGGGCCAGACCCTGAGCCCGCTCGTACG 

GCAGCAGGCCCGCGCGGATGGCGGCCAGGTGGTCGTCGCCTCGCTGCCGCGCGCGAAGGACGC 

GGGCGCCGACCACCTCGCGGTCATCGAGGCGCTCGGCCGTGTCTGGAGCGCTGGTGGGACGGT 

CGACTGGAAGCGCTTTCACGAGGGCGAGGCGCGGCGGCGGGTGCTGCTACCGACCTACCCCTT 

CGAGCGGCAACGATACTGGGCCTCTCCGCGCCACACGAGCGCTCCGCCGGAAGCGATAATCAA 

GCCGCTCCTCGCGAAGAACCCAAACGTCGCCGATTGGTTCTTCCTCCCTGCCTGGCGGCGCTC 

GGATCCTCCGGTCTCGTTCGACGCGCAGGCGGTGACCACGCGGCGCTCTACGTC-GCTCGTCTT 

CATCGGGGACGAGGGCCTCGGCGCGGCGCTGGTGGAGGGCCTCGCGCGGCGGGGGCACGAGGT 

CGTCGCGGTGGTCACGGGTGAGAGGTTCGAGCAGACGGGCACGCAGCGCTACACGATCGATCC 

CGCCGCGAATGGCGATGTTGCGTCCCTCTTCGCGCGGCTCGAAATCGAAGGGCGCATGCCGGA 

CCGGATCGTCCATGCCTTCTGCACGTCGCCTGCGGACGGCGCGCGCATCGAGCGCGGAGCCGC 

GCTGGAGATCGAGCGCAGGCTGGGCTTCGATAGCCTCCTCCTCCTCGCCCAGGTGATCGCCGC 

ACAAAGGCATCCGAAGCCGCTGA7GCTCGGCGTGATCACGACCCGGGCGCACTCCGTCATCGG 

AACC3AGATCATCGAGCCCCTGCGCGCTCTGGTGCTCGGCCCCTGCCGCGTCATCCCGCAAGA 

AATACCCCATGTCTCGTGCCGGAACATCGATATCGATCTCCCGGGCGAAGGCGGGCGCGCGGA 

GATCGCGGCGCGCCTGATCGCCGATCTGGAGCGAGAGTCGCCCGACTCGGTGGTGGCCTACCG 

CGGCGGCCGGCGCTGGGTCGAGAGCATAGAGCTCACCGATGTCGGCCGGCGGTCAGCTGGCGC 

CGCCCCGCGCCTCCGCCAGCGCGGGGCGTACCTCATTACCGGCGGCCTGGGGGGCATCGGCCT 

CGTGGCTGCAGAGCTCTTGGCCCGAGAGGCGCACGCACGGCTGATCCTGGTTGGGCGGACAGG 

CCTC-CCAGCGCGGCAGGGGTGGGACGACTGGCTCGCGGCGCACGGCGCGGGCGACGCGACGAG 

CCGAAAGATCCTCCGGATCCGCGCGCTCGAGGAGGCCGGCGCCGAGGTGAAGATCGCCGCGGC 

CGAC3TCTCCGATTTCAATGCGATGCGGAGCGTCATCGAGGAGGCCCGGACGCGCTTCGGCCG 

CATCGACGGCGTCATTCACTCCGCGGGCATCGCGAGTGGAGGCATGATCCAGCTCAGGACGCC 
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GATGGCGGCTTGGCGCGTGATGGCGCCGAAGGTCGGCGGCACGCTCGTGCTCGATGCGCTCCT 

CCGGGACGAGCGTCCCGACTTCCTCCTGATCTGCTCGTCGTTGGCCTCGCTGGTCGGCGGCGC 

CACCCAGATCGATTACTGCGCCGCCAACGCCTTCCTCGACGCCTACGCGCAGAGCCGCGAGGG 

CGAGGAGGGATGCCGCGTCATCTCGGTGCAATGGGACACGTGGAGTGACGTCGGGATGGCGGT 

GGACTTCAAGCTCCCGGCCGATCTCCAAGAGGGGCGCCGCGAGAGCCTGAAGCGGGGCATCAG 

CTCGAGCGAGGGCGCCGAGGTGCTCGGCCGCATCTTGAGCGCAGGCATGAGCGGCCCGCTGGC 

GATTTGCACGTCGGATCTACCAGCGTACAAGCAGTCTGTCACGACACGCCGATCGCAGCACGA 

GCAAACTCCCGCCGCCCGGCCGATGCACTCGCGCCCAACGACCACGGGAGCCTATGTCGCTCC 

CGAGACCGAGACCGAACGGCGCATCGCCGCGATCTGGCAGGATCTCCTCGGCCTCGAGCAGGT 

AGGCGCAAACGACGATTTCCTCCAGCTGGGCGGCCATTCGCTGTTGGCCACGCAGGTCCTGTC 

TCGCGTCCTGCAGACCCTCAAGGTGGGGATCTCGTTGCCGCAGTTCTTCGATGCGCCGACGGT 

CGCAGGGCTTTCGCGCCTGGTCGACGCAGCACGGGCCGAAGGCGCCGGACCCGTCGCGCCGGC 

AATCGGCCGTGTCGAGCGAGACGCCTACCGAATCAAGCCGCCCGCGGCCGAACAGGCCGCCCG 

CACCAAGCCGTAACAAGAAGGGGATCGAGTCATGGAACCCGTCGGCGGCGTGGACATGAATCA 

GCCCGCAAAGCAGCAGGAGACCTGCGTCTTCCCGACCTCCTTCGCGCAGCGGCGGCTCTGGTT 

CCTCGACCAGCTCGAGCCGGGGAGCGCCGTCTACAACATGCCCGCCTCCTTCCGGACGCGCGG 

GCCGTACGACGTCGACTCGCTCGTGCGCAGCGTGAACGAGATCGTGCGGCGCCACGAGTCGCT 

GCGCACGACCGTCGATGTCATCGATGGCGAACCCGTGCAGGTGATCGCCCCCTCGCTGCGCAT 

CGAGGTGCCCGTCGTGGACCTGAGCGAGATCGACGAGCCGGAGCGAGAGGCGGAGGCCCGGCG 

GCTCATGGCGGAGGAGAGCCGCCGCCCCTTCGATCTCACGCGAGGGCCGCTGCTCCGAGCCAA 

GCTGCTCCGGCTCGGCGAGGCCGATCACGTGCTGATCTTGACGATGCATCATATCGTCTCCGA 

CGGCTGGTCGATGGACGTGCTGTTCAAGGAGCTTTCCACGCTCTACGCCGCCTTCCACGAGGG 

CCGCCCGTCGCCGCTCCCGGAGCTGCCGATTCAATACGCCGACTTCGCGGTGTGGCAGCGGGA 

GCTGCTCCAGGGCGAAGTTCTGGAATCGCACCTCGGGTACTGGAGAGAGCACCTCCGCGGCGC 

CCCCACGCTGCTGGAGCTTCCGATGGACCGGCCCCGGCCGCCGGCGCAGAeGTTCCGGGGCTC 

CCAGCGCGCGTTCCGACTCCCACTCTCCCTGCAACAGGCGGTGCAGGCGCTCAGCCGGCAGGA 

AGGCGCGACCCCCTTCATGACGCTGCTGACGGCGTTCAGCGTGCTGCTCTCGCGTTATGCGCG 

GCAGAGCGATCTGGTGGTTGGCACGCCCATCGCGAATCGCACCCGAGCAGAGCTGGAGGGGCT 

GATCGGCTTCTTCGTCAACATGCTGGCGCTGCGCATCGACCTCGGGGGCGACCCGAGCTTCCG 

CGAGCTGCTCGGGCGGGTGCGGGAGGTGACGTTGGGCGCCTACGCGCACCAGGACCTGCCCTT 
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CGAACGGCTGGTGGAGGAGCTGTCACCAGGGCGGAGCCCCAGCCACAGCCCCTTGTTCCAGGT 

GTCCTTCACGTTGCAGAACACCCCGATGGATGCGACGAACAGAGCAGACATTGCATCGGGTGG 

CGCGCCGCTGGTGGAAATGAAGGCGGCGAAATTCGATCTGATCCTGGAGCTCTCGGAATCGCC 

GCAAGGGTTGCTCGGCACGTTCGAGTACAACACCGACCTGTTCGACGCCGGCACCATCGAGCG 

GATGGCCGGCCACCTGGAGGTGCTGCTCTCCAGCGCCGTCGCGGCGCCGGATCGACCCATTGC 

GGAGCTGCCGCTCATGGGGGCCGAGGAGCGCAGTCGGGTATTGGTGGAGTGGAACTCCACTGC 

CGCGCTGTATCCCGAGGACCATTGCATGCACGAGCTGTTCGAGCAGCAAGTGGAGCGGTCGCC 

CGAGGCGACCGCGGTGCTCCTCCAGCAGCAGACGTTGACGTATCGAGAGCTGAACATGCGCGC 

CAATCAGCTCGCGCATCACCTGCGGAGCCTGGGCGTGGGCCCAGAGGTGCGCGTCGGGTTGTA 

TCTCGAACGGTCAATCGAGACGGTCGTGGCGATCCTCGGCGTGCTCAAGGCTGGCGGGGCCTA 

CGTGCCGCTCGATCCGACGTACCCCAGCGAGCGCCTCGGGCTCATGATGGCGGACGCAGCGCC 

CTCGGTGCTGCTCACGCAGGCGTCGCTCCTCTCGAAGCTGCCGCCCCACGGGGATGCAACGCT 

GGTACAGCTCGACGCGCTGCACGAAGCGCTCTCCAGGCTGCCACACCATACCCCGCGGAGCGG 

CGTCACCGCCCAGAACCTCGCATACGTCATGTACACTTCCGGCTCGACCGGGCGGCCCAAGGG 

CGTGCTCGTCGAGCACCGCGGCCTCTGCAACCTGCCCACCGTGCAGGCCAAGCTCTATGGAAT 

CGCGCCGGGCGACAGGCTCCTCCAGTTCGCGCCGCTCTGCTTCGACACATCGTTCTGCGAGAT 

CGCGCTCGCGTTGCTCTCGGGAGCGACGCTGGTCATGGGCACGGCGGACGAGCTTCTCCCGGG 

ACCTCCGCTGGTCGAGCTGCTGAAGAAGCACGCGGTCACGGCGATGCTCCTGGCCCCTACCGT 

GCTCGCAGCGCTGCCAGAACAACAGAGCGCGGCGTTGCCGCTGCGCGTGCTCACGATGGCCGG 

TGAGGCGTGCCCGGCGGAGCTCGTCAAGCGCTGGAAGGCACCCGGACGGCGCCTGTTCAACTC 

CTATGGCCCGACCGAGACGACCATTTGGGCAAGCTCCGCAGCGGACCTGTCCGACGAACGGAT 

CCCGCCCATCGGCCGTCCGATTGCCAATACGCAAATCTACGTGCTCGACGAAGCGCTCGAGCC 

GGTGCCCATCGGCGTGCCGGGCGAGATCTTCATCGGCGGCGTGC-GCGTCGCCCGGGGATATCA 

CGGGCGTCCGGACCTGACGGCCGAGCGATTCGTACCCGACCCCTTCGGGCAAACCAAAGGGGC 

GCGCCTGTATCGGACCGGCGATCGGGCGCGCTGGCTGCCGGACGGAAACCTCGAGTTTCTCGG 

TCGAAACGACGAGCAGGTGAAGGTCCGCGGTGTCCGCATCGAGCTGGAGGAGATCCGCGCGGC 

GTTGCTCAAGCACCCGGCGGTCGCTCAAGCCGTGGCCGTGGTGCGCGAGGACACGCCGGGGGA 

CAAGCGGCTCGTCGCGTATGTCGTCGGACGCGGAGGAGCGCGCGTGACCGCCGCGGAGCTGCG 

CCAGTCCGTGAGCGAGCGATTGCCTGCGACCATGGTGCCATCGTCCTTCGTGGCGCTCGACGC 

CTTGCCCCTGACGCCGAATGGCAAGGTGGACCGCCGCGCGCTGCCGGAGCCCGAGCAGAGCGC 
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CGGCGGCGAGGACCACGTCGCGCCC-CGCAACGCCGTCGAGGAGGAGCTCGCCAGGATCTGGGC 

GAGCGTCCTCCGGCTCGAAAGGGTCGGCGTCCACGACAACTTCTTCGAGATCGGCGGCGACTC 

GATCCTGAGCATCCAGATCGTGGT3CGCGCGCAGCAGGCAGGGCTGCGCCTCACCCCGCGTCA 

GATGTTCCAGCACCAGACCATCGCCGAGCTTTCGACCGTGGCTAGAGCCGTCGAGGCGGTCCA 

CGTCGAGCAGGACCCGGTGACCGGTCCCGCGCCGCTCACGCCGGTGCAGCGCTGGTGGCTGGA 

GCAGGAGGCGGCCGAGCCGCACCACTTCAACCAGTCGATCTTCCTCGAGGTACGCGAGCGGCT 

CGACGAGAGCGCGCTGGAGCAGGCCATCGCGCATCTGATCGACCACCACGACGCGCTCCGGTT 

GCGCCTCGCGCGCGACGAACGCGGCGCCCACCAGGTCTTCGCCGCGCCGGGAGGCTCGACCCC 

ATTTCAGCGCGTCGACCTCGGGGCGCTGCCCAGCGCCGAGCAGATCTCCGCCATGGAGAAGGC 

CGCGAGCGAGGCGCAGGCGAGCCTCGATCTGGCCGCGGGCCCGGTCGTCCGCGCCGTGCTCTT 

CGACCTCGGCGAGGTCGCCCCGCAACGGCTGCTCGTCATCGCCCACCATATTGCGGTCGACAG 

CGTCTCCTGGCGGATCCTGCTCGACGATCTCTTTGGGGCCTATGAGCAGGCGCGCCGCGGCGA 

GGCCGTACGCCTGCCGCCCAAGACCACGTCGGTCAAGCGCTGGGCCGAGCTGCTGACCGAGCA 

CGCCGGCTCCGAGGCCGTCAAGGCGGAGCTCGGCTACTGGCTCGACTCATCGCGACGAACGGT 

AGCTCCGCTGCCCGTGGATCGACGGGCCGGCGAGGACGTGTGGGGCTCGGCGCGCCACATCGT 

CGTCTCGCTCACGCCGGAGCAGACGGAGCAGCTCCTGCGCGAGGTGCCGCAGGCGTACCGCAC 

ACGGATCGACGACGCGCTCCTCACTGCGTTCGCGCAGGCCATCGCTCGGTGGACGGGCTCGCC 

GGCGGTGCTCCTCGACCTCGAGGGTCACGGGCGCGAGGAGCTCGCCGGCGTAGACCTCACGCG 

CACGGTCGGCTGGTTTACGGCCATGTACCCGATCCTACTCCGCGTCGACGCGGCGGATCCGGG 

TGAGGCGCTCAAATCGATCAAGGAGCAGCTCCGCGCCGTGCCAGGCCGCGGGCTCGGCTACGG 

CTTGTTGCGTTACCTTCGGTCCGA7ACCATCGCCGAGGTCCGCGCGTTGCCGCAGGCCGAGCT 

CTGCTTCAACTACCTCGGCCAGCTCGATCAGGCGATCCCCGAGGCTGCACCGTTCCGGCCGGC 

GCGCGAGTATCAAGGCTCGGAGCGCAGCCCCGGCGCCCATCGCGCCCACCTCATCGAGGTGAA 

CGCGAGCATCGCCAATGGGCGCCTGTACGCCACGTGGACGTACAGCGAGCGCCGCCACGAGCC 

CGAAACCATCGAGCGCGTCGCGGCGAGCTTCGTCACGGCGCTCCGCGCGCTCATCGCGCACTG 

CACCTTGCCCGAGGTCGGCGGCAACACGCCTTCCGACTTCGACAAGGTGCGCCTGCGCCAGGA 

GACCATCGATGCTCTCGACGCAATCGACGCGGGCCCCGGGCCGTCTGCGAGGGGGAGCCGAAT 

CGAAGACGTCTACCCGCTCTCGCCGCTCCAGGAGGGCATCCTGTTCCACACGCTCTACGCCAC 

CGATTACACGGCGTATGTCGAGCAGTTCCACTGGACGCTGGAGGGCGATTTCGACGCCGAGGC 

GTTCACCCGCGCCCTCCAGGACGTCGTCGCTCGGCATGCCGCCCTGCGCACGTCGTTCGCCTG 
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GGAGCGCCTCGATGCTCCACTTCAGATCGTCCGCACGGGCGCGGTCCTCCCCGTCGAGCACCA 

GGACCTACGCGGCCTCGCCGCGGAGGAGCAGACCGCGCACATCTCCCGTTACGTCGAGGCAGA 

GCGCCAGCGCCGGTTCGATCTGCGAAAGGCGCCCCTCATGCGCGCCGGGCTGCTCCGGCTCCG 

CAAGGACGCCTGGTGCCTCGTCGAGACCATCCACCACCTGATCCTGGACGGCTGGTCGACACA 

AATCTTGCTCAAAGAAGTGTTCACGCTCTACGAGGCGCACCGCGGACACCGTGGGCATCTCGC 

GCTGGAGCTCGAGCAGCCGCGGCCCTACGGCGATTACATCGGCTGGCTCGCGAAGCAGGACCA 

GGTGCGCACCGCGGCCTTCTGGCGGCGCGAGCTCGAGGGCTTCTCCGCGCCGACGCCGCTCGG 

CGTCGACCGCGCTGTGCCGCACGACGACGGCGGCCCGCGGTTTGGTTGGCGCCGCATCGCCCT 

CTCGGGCGACGACGCGGCCCGGCTCGCCGCCTTCGCGCGTCAGCATCAGCTCACGATGAGCAC 

GCTGGTGCAAGGCGCGTGGGCGCTGCTCTTGTCACGCTACAGCGGCGATCCCGACGTGCTCTT 

CGGTATGACCGTCTCGGGCCGCTCGGCGCCGATTCCCGGTATCGAGCGCATGACCGGCCTCTT 

CATCAACACCATTCCGGTGCGCGTGCGCGAGCCTGCCGACGCGTCGGTGCTCGCGTGGCTCAA 

GGCGCTCCAGGAGCACGAGGCAGAGCTGCTCGAGCACGAGCACAGCCCGCTGGTCGAGGTCCA 

GGCCCATAGCGACGTGCCGCGCGGGACCCCGCTCTTCGAGAGCCTCGTCGTGTTCGAGAACTA 

CCCGGTGCAGGTCATCTTCGAGGCCCCTCCGGTCGAGGGGCCGACGCGCGCGGAGGAGGGCCT 

CCGCATGATCGATGCGCAGTATATCAGTGATCCACCGTATCCGCTGACGGTCGTCGCGGCCTT 

CCATGGGACGCTTTATCTCAATATTGGCTACGAGCGCCGCCGGTTCGACGACCAGGCCGTCGA 

ACGGATGATCGGGCACGTCACGACGCTGCTCCGGGGCTTCGTGCAGAGGCCCGAGACGTCGGT 

CCGCGATCTGCCGTTGCTGACGGCCGAGGAGGAGCGCACCCAGCTCCACGCGTGGAATGCCAC 

GGCCGCGCCGTATCCCGAGGGCCATTGCATGCACGAGCTGTTCGAGCAGCAAGTGGAGCGGTC 

GCCCGAGGCGACCGCGGTGCTCCTCCAGCAGCAGACGTTGACGTATCGAGAGCTGAACATACG 

CGCCAATCAGCTCGCGCATCACCTGCGGAGCCTCGGCGTGGGCCCAGAAGTGCGCGTGGGCTT 

GTG7CTCGAACGGTCGATCGAGACGGTCGTGGCGATCCTCGGCGTGCTCAAGGCAGGCGGGGT 

CTACGTGCCGCTCGACCCGACGTACCCCAGCGAGCGCCTCGGGCTCATGATGGAGGACGCGGC 

GCCCTCGGTGCTGCTCACGCAGACGTCGCTCCTCTCGAAGCTGCCGCCCCACGGGGATGCAAC 

GCTC-3TACAGCTCGACGCGCTGCACGAAGCGCTCTCCAGGCTGCCACACCATACCCCGCGGAG 

CGGrGTCACGGCCCAGAACCTCGCATACGTCATGTACACTTCCGGCTCGACCGGGCGGCCCAA 

GGGCGTGCTCGTCGAGCACCGCGGCCTGTGCAATCTGCCCACCGTGCAGGCCAAGCTCTATGC 

AATCGCGCCGAGCGACCGGCTCCTCCAGTTCGCGCCGCTCTGCTTCGACACATCGTTCTGCGA 

GATCGCGCTCGCGTTGCTCTCGGGAGCGACGCTGGTGATGGGCACGGCGGACGAGCTCCTCCC 
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GGGACCTCCGCTGGTCGAGCTGC7GAAAAAGCACGCGGTCACGGCGATGCTCCTGGCCCCrrC 

GGTGCTCGCAGCGCTGCCAGAACAACAGAGCGCGGCGTTGCCGCTGCGCGTGCTCGCGATGGC 

CGGCGAGGCGTGCCCGGCGGAGCTCGTCAAGCGCTGGAAGGCACCCGGACGGCGCCTGTTCAA 

CTCCTATGGCCCGACCGAGACCACCATTTGGGCAAGCTCCGCAGCGGACCTGTCCGACGAACG 

GATCCCGCCCATCGGCCGTCCGATTGCCAATACGCAAATCTACGTGCTCGACGAAGCGCTCGA 

GCCGGTGCCCATCGGCGTGCCGGGCGAGATCTTCATCGGCGGCGTGGGCGTCGCCCGGGGATA 

TCACGGGCGGCCGGACCTGACGGCCGAGCGATTCGTACCCGACCCCTTCGGGCAAACCAAAGG 

GGCGCGCCTGTATCGGACCGGCGATCGGGCGCGCTGGCTGCCGGACGGCAACCTCGAGTTTCT 

CGGTCGAAACGACGAGCAGGTGAAGGTCCGCGGTATCCGCATCGAGCTGGAGGAGATCCGCGC 

GGCGTTGCTGAAGCACCCGGCGGTCGCTCAAGCCGTGGCCGTGGTGCGCGAGGACGCGCCGGG 

GGACAAGCGGCTCGTCGCGTATGTCGTCGGACGCGGAGGAGCGCGCCTGACCGCCGCGGAGCT 

GCGCCAGTCCGTGAGCGAGCGATTGCCCGCGACCATGGTGCCGTCGTCCTTCGTGGCGCTCGA 

CGCCCTGCCCCTCACGCCGAACGGCAAGGTGGACCGCCGCGCGCTGCCGGAGCCCGAGCGGAG 

CGCCGGCGGCGAGGACCACGTCGCACCGCGCAACGCCATCGAGGAGGAGCTCACACGAATCTG 

GGCCGACGTACTTGGGGCAAAGCGGGTCGGTGTGCACGACAATTTCTTCGATCTCGGCGGCCA 

TTCCCTGCTGCTCGTCCGGGTGCATGATCGGCTCGGCCAGCGGTTCGATCGGCCGCCCTCGAT 

GGTCGACCTCTTCACCTATCCGACCGTGGCGTCGCTCGCGCGGTTCCTTGGCGAACGGGCGAA 

CGGCAAGCAATCGCCGAGGGAGGCCGCGGCGGACGTCACGGAGCGCGGCCGGCGCCGCCTGGA 

GGCGCGGGCGCGGCGGGCGAAGGCCATCCGTGGCCCGACCTGACCCGGGCACCCTTCCAAGCC 

CCGCCGTTCCTCGCACATCCGCCGCCTCGAGCGCCGCGTCCAGCGCCGCCGTTCGCCGACGAG 

GAGGCGCGAGACGACGGTCCAAGGCCTTCGTGGGCTCTTTGCCCCGCAATCCGGAAGCTGCGC 

GGCAGTTCGTCGCCCCTGCAATGCTGCCATTGTAGAGCTCCTCCGCTCGCCGCGGCCTCTTTT 

CTTGCGGCCCGTCCGCGATTGACCTCACATCCTGATCCCTTCTTGCGTCGTCCAGAAAGTGAT 

TGACGGCCAGCGCCGCGCTTGAGATCTTCCGGCGCGCGGCGATTTCATCGCTCCGGCGCGCCG 

TGACTGTCACCTGCGAAGGGATTATAATGAAACATAACATTGGGTGGCTTCTACCCGCCGCCC 

TCGCGACGCTTGCCTTCGTCCCGGCCTGCAGCCCGAATCACGGTGAGGATGCGCCCTCCGTGA 

CGTCAGCAGAGAGCGGCGCGGCGCCGAGCGCTGACTGCGTCGCGCTCGGGGCGAAGCTCCAGG 

CGGCGCTGGACGGCGCCGCCGCCGCGCAAAAGGCTCCGGGAGCCGCAGCGGCGGTCCAGAGCG 

GGGACTGTGTCTGGCGGGGCGCCACGGGCGTCTCGGACCTGGTCGCGAGCACGCCGACGAAGC 

CTGC-AGATCTCTTTCGGATCGGCAGCATCACCAAGACCTTCGTCTCTACGCTGATACTCATGC 
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TCCGGGCAGAAGGCCGGTTGTCGCTCGACGACGCGGTGTCGAAGTATGTGAAGGGCATCCCCG 

CCGGCGACCAGATGACGCTGCGCCAGATCCTCGGTCACACGAGCGGGCTCTTCGATTACACGT 

ACAGCCCGGCGCTCGGCCAAATGATCGAGGTGGATCCGACCCGCGCCTTCGCGCCGGCAGAGC 

TCATCGCCCTCGCCACGGCCGAGGCGCCGTATTTCGCGCCGGGCGCGGGTTTTCGCTATTCGA 

ACACCAATTACATCGTGGCCGGCCTGGTGGCCGAGGCGGTGTCGGGCGGGACGCTCGCCGGGC 

TGCTCCGCACGCGCATCCTAGACCCTGTGGGCCTCGCGCACACGTATCTC-GACGGCGCCGAGC 

CGCCGGTCCAAGGGCTCATCCGCGGCTACGGCGACTACGGCGCGGGCTTGGTCGACATCACCG 

ACCAGCTGTCGCCCACCGAGGCGTGGGCCGCCGGCGCCCTGGTGTCGAACGTCGATGACCTCA 

ATCGCTTCTTTGCCCTGCTCATCAGCCACGAGCTGCTCTCGTCGGACGAGCTTCAGGACATGA 

CCACCTGGACCCCGACGATGTGGCCCCACGAGCCCGGATATGGCCTCGGCCTCATCGAGCGCG 

ATTCTGCGCTCGGCTCCCTCAACGGGCACTGCGGAATCATCTGGGGCTTTCAATCGGCGTCGT 

ACGGGGTGCCCGGCCGCGGCGACGCGATCACCGCGCTCATCAACCGGAGCGACGGCGACGCAG 

CGCGGCTCGTCGACGAGCTCGCGAAGGTCGTGAAAGAGCGCTGATCGAGGCGGAATGGGAGCG 

CTTCGGCGGGTGGTGATGGCGCCCGGCGCTCAGAACGCGACGCGCAGCCCCGCGCTCAGCGGG 

CCTGCGCCGGGCXSACGCGGCCACGGCGCCCGGACCGACGAGGAGCCGCGCGACGGCGGGCGCG 

CTCGGCGCGTCGTCTCGCCGCACCCGCCGCTTGCCGAACACGTAGAGCGGCAGGCCGACGGCG 

ACCCCGGCCACCCCGCCGAGCGCGGTGGCGATCGCCACCTCGGACGCCTCGGCGCGCGCGGCG 

CTGCTGTCGTCGTGGCTCGCGAAGACCAGCACCGCGCCGCTGAGGATGGCGGCGCCGCCCAGG 

GTCGTGAGGACGAGCCCCGAGATCACCATGACCGGGCTGTTCCACTCCGTCGTCCGCTCCTCG 

AAGTCGCGGAACGCCGCCCTCGCCGCCGCGAGCTCCAGCTCGATCCGGCGCTGCTCGGCGCGC 

CGCTCGTCCGCGGAGCCGATCTCGTGGACGCGGCGGGCCTGGCCCTCCAGCGCCGCGATGCGC 

GCCTCGTGCGCGGCGGCGGTCTCCTCCCACGTGGCCCCTGGCGGGACCCCCGCCACGGCCGGC 

GCGACAGAGGGCGCCGACGCCGGGGTCGAGGCGGGCGCCGCGGGCGGCTCCGCGGCCACGGAA 

GGCGCCGCCGCCGCGGGAGGCGCGGGCGGCTCCGCGGCCACGGAAGGCGCCGCCGCCGCGGAA 

GGCGCGGGCGACTCCGCGGCCACGGAAGGCGCGGCCGCCGCGGGAGGCGCGGGCGGCTCCGCT 

GCAACGGAGAGCGCGGGCGCCGCGGCAGCCAGGCCCAGCGCCCACGCGACGACACGACGGCGC 

GCCGCAACCGCGCGCGGGCGCGCGAAGCGGAGGTGGACCTGCTCCATGCGCGCAGCGTCGCCC 

CTCGACAGGGCCGGGTCAAGGCGCGGGAGTCCGAGAGCACGAGACCTCCGCGCCGCAGGAAAC 

AGGCGCGCCGGCGGCCCGCGCGGCGGCTCGCCGCTCACCCCTCGCGCGGCCGGCCGCGGCGCC 

GCCTCCCCTCCCCGGCGGGCCGCGCGTCC<5CGGCCACGCGGAGCAGCTCCTGGAAGTGCCGCT 
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CCACCGGGCCGAGGTCGATGCCGTCCATGAACGACGTGAACGCGAAGTAC3GCAGCAGCGTCT 

GCCAGGCGCGCAGCCAGCCCGGGAG3TAGCGCGGCCGCTCCAGCACGCCGGCGGCGCGCAGCT 

CCGCCAGCACCGCGATGTCGTCGAGCGCGATCCGGCCGCACACGAGCAGCTCCACCTCGTCGC 

GCAGCCCGCCGCGAAGGACCGCGGCCAGGAAGGCGTAGACCTCGAGCAGCCCGGCCAGGTAGC 

AGGCGTCCTTGGTGAACGGCGCGCCGCCCTCGACGAGCCCGCCGCGGCACACGCGCTGGGCGT 

CGAAGTAGGCGTCGCGGCGCTCGGCGCCGCGCTCGCGCAGGTGCCGGTACAGGTCGAGGAAGC 

TCGCGCCCTGCTCGGCCATGTCCACGAGCCGCACCCGCTCGGCGAGCCGGGTGAGGCGGCCGA 

TGGAGAGCGAGCGGCTGTAGAGCTCGGCGAAGATGGCGAGCCCCTCCTGCGTGCGCGTGGTGC 

GGGGGCCGCCCGAGCGCAGGAACGCGCACCGCGGCTGCGCCGCGCCGTTGTGCGCGGTGAGCG 

CGTGCGTCTCGACCTCGTGGTGCCACAGCCCCTCGGCCTCCCACGCCGCGAAGGTCGCCTCCG 

GCCGGATGCGGACCCGGCTCATGCCGCCGACCACCTTGGCCGTGACGCGCGGGTCGACGGTGA 

TCTCGAGGTCGAGCCGCGGCGCCCG3CCGGCCACGCGGGCGGCGAGCATGTCCCGGAGCGCGC 

CGGCGTCGAGCGGCTCCTCCTCGGGATCGCTGGCCTCGTCCCAGCCGTGGACGCGCAGGCGCT 

CGGTGAGGTGCTCGGCGAGGTCGATGTTCCTGAGCGAGCCGCCGAAGAACCGCGAGCGCGCGC 

CGCCGTAGAGCTCCTGCGACCGCGCGGAGAACGCGCGGGTGCCCGCGGCCTCGAGCAGCTCCG 

CGGCCTGGATCTGCGCGCGGACGTT3TCCCGCAGCCAGCCGAGCGCCGGCGCGTCCCCGTCGA 

TGGCCCCGAGGAGCTCGCGCAGCTCGGCGACGCGCCGCGCGAGGCCGTCGCGATCGACGCGGT 

ACTCGACCTCGGGGAGGCGGTCCTCGCCGGCGGCGAAGAAGCGCTCCTCCACCTCGCGCGGCC 

AGGCGATGTCCTCGAGCAGCTTGAGGGCCTTGCCCTCCGCCAGGCGGCCGCCCACCCGATCGA 

GCTGCTCCAGCACGGCGCGGTCGAT3CTCATCGAGCGCAGGATCGCCGAAACCGCGAGACGCC 

GGAACCGTCATTCCCTCGACGAGGCAGCGATTGCCATGTTCCGTCGCTTTTTGGAGCGCCGTC 

GTCGCGCTCGCCTGCGGGCTCCGGCGATCCAGCGCGGTTGCATGCAGCGAGGGTGTTCCGGGG 

CTGGCTCGAGAGCGTCCTTTGGCCCACACCCGAGACACGAATGCTCCGCGCCGAGCGCGGTTG 

ACCGTGGACCCGCCGGAGAGCCGATGATACGGTCCGGCCGATGTCGGAGAGTGTAGCTCAACT 

CGAAGAACACCGCGCGGCGCTCACCGGACACTGCTACCGGATGCTGGGTTCGGTGGTCGACGC 

CGACGACGCCGTCCAGGAGACGATG3TGCGCGCCTGGCGGAGCCTGGATAAGTTCGACGGGCG 

CTCGTCGCTGCGCACCTGGCTGTACCGCATCGCGACGAACGTCTGCATCGACCTGCGGGCCGA 

CCGCGCGCGCCGGGCGCGCCCCATCGAGGAAGGCCCGGTCGGCACGGTGGACGACGCGCTCGA 

GACGCGCCCGCGCACCCACTGGCTGGAGCCCGTCCCCGACGCGCACGCCCTGCCGGCGGACAT 

CGACGCCGC.GGAGCGGGCGATGCTCGGCCAGAGCATCCGCCTCGCGTTCGTCGCGGCGCTCCA 
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GCACCTGCCGCCGAAGCAGCGCGCCGCGCTGCTGCTCACGGAGGTGCTCGGCTGGTCCGCCGC 

GGAGGTCGCCGACAGCCTCAACACCTCGGTCGCCGCGATCAACAGCGCGCTCCAGCGCGCGCG 

GGCGACGCTCGCGAGCCGCGATCTCGGCGACGCGCGCCCCTCGCTGCCGGAGCCGCAGTCCGC 

GCTGCTCGACCGCTACGTCAACGCCTTCGAGCGGTACGACGTCGACGCGCTCACGGCGCTGCT 

GCAGCAGGACGCGACCCTGTCGATGCCGCCGTTCACCCTGTGGCTCCGCGGCCACGAGTCGAT 

CCGCGCCTGGCTCGTGGGCCCGGGAGCGGGCTGCCGCGGGTCGCGGCTCATCCCGACGGCGGC 

GAGCGGCTCGCCCGCGTTCGCGCAGTATCGCCCGGCGCCC<3AGGGCGGCCACCGGGGCTGGGC 

GCTCATCGTCCTCGACGTCGCGGGGGACCGCATCGTCAGCATGACGTCCTTCCTCGACACCGA 

GACGCTCTTCCCGCGGTTCGGCCTGCCGCTCGATCTACCGGCGTAGCCGCGGGCGCCCTGCCT 

GCCTCGCCGCGGGTGCCCTGCCTGCCTAGCCGCGGGCGCCCGGCCTGGCCACGGGCGCCCGGC 

CTGGCCACGGGCGCCCGGCCAGCGACGGGGCGACGATTTTTTTCTGAGCGACCGATGAGTCCT 

GACGGGGCCGGGGGTCTACGGGGGTGAATCCAACACGGAGGCACCCATGACCGTGACCATCGC 

CAGCATCGATCATCGTGACCAGGACCTCATGACCGGGCCCCAGGCCAAGGCGCCGGCCCGCGC 

GGCGGCGCCCGACGCGGCGCCGTCCAGGCGAGCCGTGTGGGCGGGCCGCGTCCTGAGCGGGCT 

GGCCACGCTGTTCCTGACGTTCGACGCCGCGGTGAAGGTGCTGAAGCTGTTCCCCGCGGAGGC 

GTCGACCGCCGAGCTCGGGTTCCCGGCGCACCTCGTCCCCACCCTCGGCTACCTCCAGATCGC 

TTGCCTCGTGGCCTACCTGATCCCGCGCACCGCGGTGCTCGGCGCGATCCTGTGGACCGGCTA 

CCTGGGCGGCGCGATCGCGATCCACGTGCGGGTCGAGAACCCGCTCTTCAGCCACACGCTCTT 

CCCCATCTACGTCGCCGCGTTCCTCTGGGCGGGGCTCTGGCTGCGCGACCGCCGCGTGCGCGC 

GCTGACCGCGAGCCCGTCGTCGCAGGGCCGATGAGCTTCACGTTTCACGAGAGTCCATCACGG 

TAAAAGGAGAAGCGAGCCATGACCACAAAGAACCCCCGCAAGCTCTTCGTCAACCTGTCCGTC 

CGCGACCTGAAGCGATCGATGGAGTTCTTCAGCAAGCTCGGGTTCGAGTTCAACCCGCAGTTC 

ACGGACGAGAAGGCCGCCTGCATGGTCGTCAGCGAGGAGGCCTATGTCATGCTCCTCGTGGAG 

TCGTTCTTCAAGACGTTCATGAAGAAGGAGATCTGCAGCACGAGCACGCACACGGAAGGGCTC 

TTC3CGCTCTCGTGCAGCAGCCGGGCCGAGGTCGACGACATGGTGAAGAAGGGGGTCGCGGCG 

GGCGGGTCGCACGCGATGGATCCGCAGGATCACGGCTTCATGTACGGGTGGAGCTTCTACGAC 

GTG3ATGGCCACCACTGGGAGGTCATGTGGATGGATCCCAAGGCGATCCAGCCGTAGCCGACG 

GGGCTGGGCGCGCCGCCTGGAAGAGCCCCCGTGAGGCGGGGAGGCGGGAGGATCACCGTCTTC 

GTA3CCCACAGCGATGCAGTATCCGTCGCGCTTCGTATCGAAGCACGGCTGTTACGGGCGCGT 

CAGAGCGCCTCGCAGGTGATGCCGAGCCGCAGCAGCGACACGGGCACGAGCGTGGCTCCGATG 
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GAGATGAGCCGAGTCTCGCCCATGGTCTCGGGGTCATGAATGGATGAGTAGGGGACTCGCTCC 

TTCGTCACGTCGTGCTCGACGGCGACGGCGAGGCCGAGCTCGAAGTGCACGGGGCCTGGACCG 

AAGATCCAGCTCGCCCCGGCGCGAGCCCCGACGAAAAGCGTGTCGCCGTCGACGCCAGGGCCG 

TCGTCCCAGCCGGGCGATCCCACCGCGGTGTAGGTGTGTTTCCCGAAGGAACCCGCGAGCGAG 

AGTCGAAGTCCGACCGGCGCTCGCCACGCGACGCCCGCTGTCGCGCCGACGCCGCCGAAGCTC 

TCCCCGAAAGGCTTATCCCCTGTCTCGATGAAGCCACCCACCTCGATGACGCTGATGCGGTAC 

GTGAGCGCGAGATTGAGGTGCACCCCAGCGCTGTCCGAGCCCGAGTAGAGGCCGGCGCCCACC 

TGCACGCTGAAATCCATGCTCGGCGCGGATCCGCGCGCAGGAGCGACGCCAGGGGCGCTGCCC 

TCCTGCGCGCGGGCCGTCCCGACGCAAAGAAAGAGGGCTGTCGCGAAGAATCCAAGCGAGATC 

GATCGAAGTGAGCGCATGTCGGGCCCTGGAGCATCCGCTGTACCAGGTGCGTCGTATTCATGC 

GGCGCGCCGCCGCGCGCCGCCGCGCTGGCCTGTCCGACGCGAGATCACGAATCCGCCATCGCT 

CCCCTGGGCCGCCGGCCGCTCTGGTTCGCCTGCGGGCGTGCGCCGGCGCTCGTGTGGCCCATG 

GCAACCTTGTCGCGGTGTCGCTCGAACAGCACAGAGAGTATCGCGTCCGCAACAACCGCGCGA 

CCCGGCGAGACGCTCGTGGGGCCCCCTGCCTCCCCACTTCATCATAACGCCATCAGGAGCACT 

CGACATTTCATTTCTTCACCTCCACTGGCTGAGGGCGACGGTGCTCGTCATCGGCCGGTTGCT 

CTGGCGGTTGCTCTGGCGGGGTTTCTGACGCCCGGAACTAACGCTTCGAGCGCTCCCCCTTGC 

TCTCCCGTTCCTTCAGCTCCTCCAGCAGGTCGTCGAGGCGCTCGTAGCTGCCTTCCCAGAAGC 

GGCGGTAGTTGTCGAGCCAGCCGCTGGCGTCCTCGAGCGGCTTGGCCTCGATCCGACAAGGCC 

TCCGCTGCGCGTCGCGGCCGCGCGAGATCAGGCCCGCTCGCTCCAGCACCTTGAGGTGCTTGG 

AGATCGCGGGCTGGCTCATCGCGAACGGCTTCGCCAGCTCGGTCACCGACGCCTCGCCGGACG 

CGAGGCGCGCGAGGATCGCTCGCCGTGTCGGATCGGCGAGCGCAGCGAACGTTGCGTCGAGGC 

GCTCGGACGGGGTCATTGCATAACTCCTTGGTATAAAAACCAGTTAGTTATACAACCTGGGGC 

CCGGGCGGTCAAGCCTCCAGGCGATGGCGGTTCGGCCCGGGGGCTCCGCTCGCGGCACGCGCG 

CCGCGCGGCTACGTGCGCGGCGCGGTGAGCACGTCCTGCAGCGTGGCGCCGACCACGGGCTTG 

GTCAGGTGCAGGTCGAAGCCGGCCCGCCTGGACCTGGCCTGATCGTCGGGCCCGCCGTAGCCC 

GAGAGCGCCACCAGGTAGAGCGCTTCGCCGCCGGGCGCGGCCCGCGCCCGGCGCGCGACCTCA 

TAACCGTCGATGCCGGGCAAGCCGATGTCCACGAAGGCCACCTCGGGGCGCAGCTCCAGAAGC 

TTCTTCACGCCCTCCAGCCCGTCCACCGCCACCGTCACCTCGTGCCCCAGCGCCTCGATGTAC 

GCCCGCATCACCCGGCGCACGTCCTCCGCGTCCTCCACGACGAGCACCCGGCGCCGGTCAGCC 

GCCGCCTCGGGCGCCTCGGCGCGCTGCGCCGGAGGCGGCGGCGGCTCGTCGCGCTGCGCCGGA 
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GGCGGCCCCTCGCGCGGCGGGGGC3GCCCGGCGCTCGGGGCAGGCTGCGGCGCCGCCCCGGGG 

CCGAGCGGCAGGCGCACGGTGAACICGCTGCCCTGGCCCGGCCCGGCGCTCGCCGCGGCCACG 

CTGCCGCCGTGCAGTTCCAGGAGCC3CCGCACCAGCGTGAGCCCGAGCCCCAGCCCGCCCGTG 

CTCCGGTCGATGGTCTGGTCGACCrGCGTGAACAGATCGAACACCTTCTCGAGCATCGCCGCC 

GGGATGCCGCGGCCCGTGTCGCGCACCCGCAGCACGGCCTCGGGCGCGCCGACCGCCGCCTCG 

CGCGTGAGGCGCACCGAGATCGAGCCCCCCGGCGGGGTGTACTTCGCGGCGTTGGTCAGGAGG 

TTCGTCACCACCTGCTCCAGCCGCGTCGCGTCGGCCCGCATGCCGAAGTCCCCGGGCCCCACC 

GACAGCGACACGTCATGGCGCCGGGCCTCGACGGCCGGCCTCACCGCGGCGGCGGCGCTCTGC 

ACCACCGCCGCGAGATCGACGTCCTCGAGGCGCAGCTCCACCGTGCCCCGCGTGATGCGCGAC 

ACGTCGAGCAGATCGTCGACCAGCCGCACGAGGTGGCCCATCTGCCGCCGCGCGATCTCCCGG 

TAGCGCGCCGACGCGGGCCCGTCGrCGTCCGCGTCGTCGAGCAGCGTCAGCGACAGGCTGATC 

GAGGCCATCGGGTTCCGGAGCTCGTGCGCGAGCATCGCGAGGAACTCGTCCTTGCGCTGATCG 

GCGAGCTTCAGCGCCTCGACGAGC3CCTCCACGCGCCTCCGGGCGCGCACCTGGTCGGTCACG 

TCGAACGCGAACACGAAGACGCCCTCGACCGCCCCGTCGCGATCGCGCATCGGCTGGTAGACG 

AAGTTGAAGAACACCTCCTCCGTCG7GCCGTCGCCCCGGCGATCGAGCCGCACCGGGAGCTCC 

TTGCCGACGATGGGCTCGCCGGTGCGGACCACCGCGTCGAGGAGCTCCCAGATGCCCTGTCCC 

TCGAGCTCGGGGAGGGCGGCCCGC-ATGGGCTCGCCCACGAGCGATCGACCGCCGACGAGCCGC 

TGGTAGAGCGGGTTGACCACCTCGAAGACGTGCTCCGGCCCGCGGAGGATGGCGATGGGCCCC 

GGGGCCTGCATGAAGAGGTCGTTCAGGTACTGGCGCTGCCCCTCGGCCTCGCGCCGGCGGCGC 

GCGAGCTCGACGTGGATGCGGACCCGCGCGAGGAGCTCCTTCGCGGAGAACGGCTTCACGAGG 

AAGTCGTCGGCGCCGGCCTCGAGGCTGTCGACGCGCGCCTCCTCGCCCGCGCGCGCGGAGAGC 

ATCACCACGGCGACGCCGCGGGTGGGATCGTCGGCGCGCAGCGCCCTGAGCAGGCCGAAGCCG 

TCGAGCCGCGGCATCATCACGTCGGTGAGCACGAGATCCGGCGGGTGGGCGCGGGCGCGCTCC 

AGGGCGGCCCGACCGTCGGCCACG3CCTCCACCGTCCACCCCTCCGCCACGAGCAGCCGCAGC 

GCGTACTCGCGCATGTCCGCGTTGrCGTCGGCGACGAGGACGCGCCCCGGCAGCCTCCCGGCC 

GGCCCCTCGCCCGCCGGCCGGGAC3CCGGCGCCTGCTCGCCGCGGAGCCACTGCGCGGCCTCG 

TCGAGGAAGGGCGCGGCGTCCCGCCCCCCCGCGGCCGGCGCCGAGGCCGGCGCGAC . 



or its complementary strand, 
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(b) DNA-sequences which hybridise under stringent conditions 
to regions of DNA-sequences according to (a) encoding proteins 
or to fragments of said DNA-sequences, 

ic) DNA-sequences which hybridise to the DNA-sequences accord- 
ing to (a) and (b) because of a degeneration of the genetic 
code, 

(d) allele variations and mutants resulting by substitution, 
insertion or deletion of nucleotides or inversion of nucleotide 
segments of DNA-sequences according to (a) to (c) , wherein the 
variations and mutants offer isof unctional expression products. 

7. DNA sequence according to claims 1 to 5, wherein the DNA 
is selected from the group consisting of 

(a) the following DNA Sequence: 
Seq ID No 2 (>pEP0cos6 region) 

GGATCACCTGCGGCGCGATCGCCGACCTCGTGCTGGTGTTCGGCTCGCTGGATGAGAAGCCGG 
CGGCGCTACTGATAGAGACGGCGACGCCCGGGCTGCGGGTGGAGCGGTTGCGGGAGATGCTCG 
GCTTTCGGGCGGCCCACCTGGCGAAGCTGTCCTTCGACGGTTGCGAGGTCCCCGAGGCTCAGC 
TGATTGGCCGGCCCGGCTTTGCGCTGATGTATCTGGCCCCCTACGCCCTGGATTTCGGTCGGG 
TCAGCGTCGCCTGGGCCTGCCTGGGCATGATCCGCGCTTGCCTGGAGACCTGCGCACAGCACA 
TCCTCACCCGCCGCACCTTCGGCCACCTGCTAGCCGATCACGGCATGATCCAAACCCTGATCA 
CCAACCTGGGGATTCACCACCAGGCGACGCTGCTCCACACGCTGCAGGCCTGCCGCGCCAGGG 
ATCGCGGCGACGTGACCGCCTCCGAGGCCACCCTCGCCGCCAAATACCTCGCGTCGCGGACGG 
CGGTCCAGGAGACGACCAACGCGG-CCAGATCATGGGCGCGCTGGGCTGCGACGAGGAGGGCG 
CGATCGCCCGCCACTTCCGCGACGCCAAGACGACCGAAATCATCGAAGGCAGCAACCAGATCA 
TCGAGGCGCTGCTGGCCAAGAACATCGCCCGCGCCGGTCGCGACAACTATCGCCGCTTCCTCG 
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A7GCGGAAGTCGAGCCCGGTCGGGCCGGAGGCGCACCATGACGAGCGCGGTCCCGACGCGTCA 

AACCAGCCTGCTCGACGACTTCGAGCGCGTCGCCGACGTCGATCCAGAGCGGATCGCCGTCCA 

CGCGAGCGAGACGAGCCTGCGCTATGGCGACATGAATGCGCGCGCCAACCGCATTGCCCACGG 

GCTACGGGCGCGCGGGATCGGGCCCAATCAAATCGTGGCGGTGGCGATGGCCCGCACGCCCGA 

GCTGATGATCGTGCTGTACGGCATCCTCAAGGCCGGCGCGGCCTACATGCCCATCGCCCGCGA 

CGCGCCGCCGCTGCGCCGCGATCATATGCTGCGCGAGAGCCAGGCTGCTCTGATGATCGCCGA 

CGAAGAGATCGCGGGACTCGCGGCCCGGGTGCTGACGCCGGCCGACCCGTTCTTCGCGGCCAT 

GCCGGACCACAACCCCGAGCCGCGTCACGACCCGACCGACCTGATTTACGTCATCTACACCTC 

GGGCTCGACCGGCCAGCCCAAGGGCGTGGCCATGGAGCACCGCGCCGTGTGGAATCGCCTGAC 

TTGGATGCAGGCCCAGTATCCAATCGACACGCAGGACGTGATCCTCCAAAAGACGCCGATCGT 

CTTCGACGTGTCGGTCTGGGAGCTGTTCTGGTGGCCGCTGGCCGGCGCCTCGGTGGCCCTGCT 

GCCGCAATCCATGGAGAAGTTCCCCTGGGCGATATCGGCGACGGTGGCGCGGTGCGGGGTGAC 

GGTGATGCATTTCGTACCATCGATGCTGATGGCCTTCCTTCAGGTGGTGGCGGGGCGGCCCGA 

GATGGCGGACCAGATGAAGGGCCTGCGCTACGTCTTCTGCAGCGGCGAGGCCCTGGCGCCGGC 

CCACGTGTCAGCCTTTCAGGAGCACATCAACCGAGCGGGCAGCATCAGCTTGACCAACCTCTA 

TGGACCCACCGAGGCGGCGGTCGACGTCAGCTACTTCGACTGCCCGCCCGGCGCGTCACTCGC 

GCGGGTGCCGATCGGACGAGCGATCACCGGCATCCAGCTGCTGGTCATGCGCGACGGCGTGCC 

TCAGCCGCCCGGCGTCGAGGGTGAGCTCGCCATCGGCGGCGTTGGTTTGGCGCGCGGCTACAT 

CTCACGGCCAGACCTGACCGCCGACCGGTTCGTGCCGCATCCAGGCGGCGACGGCCAGCGGCT 

CTACCGCACCGGCGATCTGGTGCGCAGGGACGCGGACGGCGAGCTGGTCTTCCTGGGGCGCAT 

CGACCATCAGGTGAAAATTCGCGGTCTGCGCATCGAGCCCGGGGAAATCGAGGCCCAGATCAG 

CGCCCATCCCGATGTGGCCGACTGCGCGCTGATTATCGAGCAGGACTCGGAAACCCTGCCCAA 

GCTGACCGCCTACATTGTCGTGGCGCGACCGGGCTTGACCCGGAAGGCGCTGCTACAGTTCCT 

GGGCGCGCGGCTGCCCGACTACATGCTCCCGAACCGCTTCCTGACCCTCACGGAGCTGCCCGT 

GACCGCCAACGGTAAGCGCGACTGGCGCGCGCTGCTCGGCCCGCTCGAGACCCTGCCTCTCCC 

TTTCTCCTGAATCCAACCAATACGAGGGATTCATGTTACACCCGATTCCCACCGACCGTTTCG 

CCCTGAGCCGACCGCTCTTTCGCGGGTACCTCGCGCACGATCCGATCGTGCAGGGCGTGCTGG 

CGGGCGACCATCCAGGCTGGGTCCTGGTGGACCGCGAGCCCGAGCCGCGCACGGCGCTGCTGT 

GGGCCTTTTCCGATCGGCTCTTCTGCGTGGGCGCAGCTGACACGCTGACCCCGCACGCGCTGG 

CCGAGCTCTTCCACGACCGACTGATCCCCCAGGCCCGTAAGATCGGGCAGCCGTTTTTCCAGG 
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TTCAGGGCGAGACGGTCGACACCTGGTCGGACCACCTGCATCAGGTGTCGCCGCACGCGACAG 

TCTCCTTCCGCCAGGCATTCCGCTTCGACCGCGACCTCTTCGAGCGGCTGCCAACCAAGCCGG 

AGCTGGCAGAGGCGCGGCTCG7GCCAATCGACGCGCGGCTGCTGGCCGAACAGGCTGATCTGC 

GCGAGCGGATACTGGCCTCCTGGTCCAGCGAAGCTGCCTTCCATGCGCGCGGTTTCGGCTTCT 

GCTACCGCGTAGGTGACCAGCTGCCGAGCGTG7GCCTGGCATCGCACGTAGGCGGCGGCGCGG 

CCGAGCTGAGCATCAACACCGAGCTCGAAGCGCGCAATCGAGGTATGGCAACGCGGCTGTGCC 

GGCGTTTCATCGCCGAATCGCTGCAGCGCGGCCTGACGCCTTGCTGGGGCACCGAGACCTTTC 

GCCTGCCGTCAATCGCGCTGGCCCAGAAGCTCGGTTTCATCCCGACCTTCACCTTCCCCACCT 

ACTGCTTCGCGACCGGCACCGAACAGCCGGACGACAACTTCCTAGGCGAGCTGTACTACAGGG 

AATCGCGCATCGCCGGAAGTGGGACCGATGAGCCGCAAGCGGTTCGGCTGGCGCGGGGTTGGA 

GCCTGGCCGGCGACACCGAGCGTGCCGCGAGCTTCGCCGCACGCGCCCTGGCCGAAGGGTGGG 

CCGGCCACTCGACTCTGGCCACCGATCCGGATTTCGCCCGATTGCGCGCCAGCGCCGCCTGGC 

CCCGCCTCAATGTCCCTTGAAAGGTCACGTGGACTCATGATGTCCCCTTGAAAGGTCACACTC 

CGAGTCATGATGATTTGTCACTCCCACCGCTTCATTTTCCTCCACGTTCCCAAGGTCGCCGGC 

ACAAGCGTCAAGGACGTCCTCGGCCAAGAGCTATTCCAGGAGGACCAGGTCACGTTCCAGATC 

GCTCCCAATCCCCACTACCCACCTGAATGGACTGCGCCTTACGAGGAGCACATTATTGCCGCT 

GAATTGAAGAGCCAGTTGGCGCCGGAAATTTGGGACGATTACTTCAAGTTCGCCTTCGTGCGC 

CATCCGCTCGACTGGGCGGTCTCCAATTACTTCTTCTTCCTGCGCGACCGCAAAGGCCATCCG 

GCCCACGAATTCCTGGAGCGGAAGC-GCTTCGCCGGTACCATGGACATGTTTTTCGGAGCGGCC 

GGGCGCCATCCGCTGGTCGCCGGCATGCGCTTCAGCCAATGGGAGTTCTTGTGCGACAGCGAG 

GGCCGGACGCTGGTGGACTTCGTTGGCAAGTACGAGCGGCTCGAGCAGGACTTCGCCGCCGTG 

TGTATCCGCATCGGGCTGACCCCGCCCGACTTGCCGTGCCTCAACCAGACTCGCCACCAATCC 

TTTACCAGTTACTACGACGAGGCTTTGATGCGCCAAGTCAGCCGCGCGTTAGCTCGCGATTTC 

GAAATTTTTGATTATGCCTGAGGCGGACCCGTTGCTTCGCCACCGGTGGATTATTCGATAAGT 

TATTATATTTTCAGTTGATCATGTGAATGTCGATCCAGCCAACGAGGAGGATACCTCCGCGTG 

CGGCTATGGGGGCGCAGAGGTCACGACTACGTGTAGAAATTTGTCGAACACACCACTAGGTGC 

CACCGATTGGGAGCnTGACTTGAAGATGAAAGTGGACAAGCGGAATGTCGACGACATTCTCG 

GACTCACTCCGACACAGACAGGCATCTTGTACCACTACCTGCTGGACCCGCAGGCCGACGCCT 

ATTTCGAACAATTGACGCTGCACCTGGAGGGGCCGCTCGACGTAGCGCGCTTCCGCCGCGCCT 

GGGAGCGCGTGGTGGCGGCTCACGACCAGCTGCGCGCCGTGTTTCGCTGGCAAGGGATCGAAC 
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ACCCGGTGCAGATCATCCTCAAGCAGCACGTGCCGGACCTGGAGTTGGCGGAGGTCCCGCGCG 

ACGCCGATCCGGCAGCCTTCCTGGCGCAATGGGTCGCGGCCGACCGGGCGCGCAAGTTCGACT 

TCGAGACGGTGCCCTTTCGCATCGGCCTCTGCCGGACTGATACCCAACATCACGTGATGCTGC 

TCAGCAATCACCATATCCTGATGGACGGTTGGAG7ACGGGCCTGATTCTGCGGGACTTCCTCG 

CCTGCTACGGCGACTCCGAAAACTGGCGGCCACGCACCCGAACGCACTTCAAGGCGTTCATCA 

AGTGGCACCAGAACCGGCCACGCCGGGGCGAGGAGCGATTTTGGCGCGACCTGTTGCGCGATG 

CGCCCGACGGCGGCTTTCCCCGCCT3GGCGTCGAAGAAGGCACCCGCCACTCGCTTGACTXCG 

GCGCCCGCAGCCGCGCTCTCGACGACCGCTTGACCCAAGGCTTGCGCGACATGGCTCGCGACC 

TCGACGTCACCCTCGCCGCGATGCTCCATACCGCTTGGGGCCTTCTACTCCAGCGCTACCAGA 

ACAGCTGCGAAGTGATATTCGGGACCACCGTTTCCGGCCGCAACGTCGAGCTCGCCGGCCTCG 

ACGAGGTGGTCGGCTTGTTCATCAACACGATTCCGTTCCGCTTCTCGGCCGCGGCCGCGACGA 

CGCCCGTCGAGGCCTTCCGTGCGGTACAGCGCAATCTGCTGGCGAGAAGCGAGTTCGAAGCCA 

CCCCGCTGGTGGACATCAAGGGCTGGAGTGGTCTCGGTCCGGGCGCGGAACTGTTCGACACCA 

TCCTGGTCATCGAGAACTATCCCTTGGACCGCGCTATCTTCGAGAGTGATTCCAGCCTGCGGT 

TGACCGACCACCAAATCTTCGAGCGCACCAATTACGGGCTGACCCTGACCATCGAGACCTTCA 

GCCGGTTGCACGTGACGCTAGCCCATCGCCGTGACCTGCTGGGCGACGCGGCCGCTGAGCGAA 

TGCTAGATCATTTCACCGGCCTGCTCCAAGCCATGCTGCGCTTCCCTCACCAGCCGTTCGCGC 

GCCTCGAGATGAAAAGCGAACACGAGGCCCACCGCGTCCTGCACCAACTCAACCAAACGCGTC 

AGCCGCTGCCGTCCCAATCGGCTTTCCACCAGTTGTTCTTCGAGCAGGCCCAGGCCGATGGGG 

CACGACCGGCGCTGTGGTGCGGCGCCACGCGCTGGACCTACGGCCAGCTGCTGGAACGTGCCC 

TGCGTCTGGCGGGACGGCTGCAGGAAGCCGGCTTCGCCCGAGGCGATGTCGCCGCCGTCAGCC 

TCGGCCCGGTTCCGGATCTGATTCCCGGTTTGCTGGGCCCGCTGTTCGCCGGCGGCGCCTACC 

TGCCGCTCGATCCCACCCTGCCGGCCCAGCGCTCGCGGTTCATCCTCGACGATGCCGGTTGCC 

GCTTCCTGATCAGCGACGCGCCACTCGCGGGGCCCACGCCGATCCATCCGGACCCTGCCGGCG 

CCAGCCCCGTTGACGTCATTTTTGCCTGTCAGGACGGCGCCGCGCAGCCCGCCTACCTGATCT 

ACACCTCGGGCTCCACCGGCCAGCCCAAAGGCGTCTGGGTTAGCCACCGCAACCTGATCAACT 

TCCTGACGGGCATGAGCGCAATCCTGCCGGTCGCGGCCGACGACGTGTTCCTCTCGCTGACTA 

CCGTGTCGTTCGACATTTTCGGGCTCGAGACGTGGTTCCCGCTCAGCCGCGGCTGCACGATCG 

TCTTGGGCACGCGCGCCGAGCAGTTGGACCCGGCCGCGGCTGCCAAGGCCATCTCCTGCCATG 

GCGTCACGGTTTACCAGGCGACGCCATCGCGACTCCAACTTCAACTGGAGCACCCCACATTTG 
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TCCGCGCCATCGGCTCCCTGACGACCCTGCTGGTAGGCGGCGAACCCCTCCCAGCCGAGCTGC 

TGCGGCGCGTACGCGAAGTGACCGATGCGCGTATCTTCAACCTCTACGGTCCCACCGAAACCA 

CCATCTGGTCCACAGCCGGGGAGGTCACCGCGGCGGACGTCCCGGATATCGGCCGCCCGATCG 

CAAATACCGGCGTTTTCCTTCTGGCGCGAGACGGCTCGATCCAGCCGCCGGGCCTGGTGGGCG 

AGTTGTGCATCGCCGGCGAGGGCGTGGCGTTGGGCTACCACCGACGGCCGGACCTGAACCGAG 

AACGGTTTCGCGAGATTCCGCCGGGCCGCCTGCCCTTTGCCGGCAAGCTCTACCACACCGGCG 

ACCTGGCCCGCTGGACCGAAGACGGACGGCTCCTCTGCCTGGGCCGTCTGGACGACCAGCTCA 

AAGTGCGCGGCCATCGCGTCGAGCCGGGCGAGATCGAGGCAGTGATGGCGCGCCACCCGGCGG 

TCACGCAGGCGGTGGTCGTCACGCGGCCGCGCAACGGCGAGCCGGTCTTGGTCGGGTTCTGGA 

CTGCGGAAGGTGAGCCGATGCCAGAGGAAGCGCTGAGCGCTTACCTGGCCGACCGACTGCCGA 

GCTACATGGTACCCGAACGGTGCArCCTCATGAAGGCCATGCCGCTAACCGGCAACGGCAAGA 

TCGACCGGCGCGCCCTACCCAATCCCTTCGCCTTGACCGAGTCGACCCGGCAGGCGGCGCCGC 

GCACCTTGGCCCGCACCGCCGGCGAGCATCGGGTTGCCGAGCTGTGGCAGGCCTTGTTGCGAC 

GCGAGGCGATCGGCTTGGACGAACCCTTTTTTCAGGCCGGCGGGAACTCATTCGGCTTGATTC 

GGCTTCACGCCAAGCTGGAATCCGCCTTCGGGAAGTCGTTCCCGATCACCGATTTGTTCCAGC 

ATACCAGTATTCGCAGCCAGGCAGAAATGCTGAGCGGCTCGTCCGTCGAGGCGCCGCTCGCGG 

GAGCCGTGCCGCAACCCCCGGCCGCCGCCGCCCAAGTTGCCTCCTCGGCAGCTAAATCCCCAG 

GGGAGCGCGGCGCGGCAGCGACGTCGAGCGGCCTGACCGCGCAACCGCCCCAACCCCACTTCC 

GGCCCATCGCCGTTATCGGCCTCGCCGGCCGATTCCCCGCCGCACCCGACCTCGACGCCTTCC 

TTGAACTGCTCACGGAGGGTCGCTGCGGCATTCGCtTCTTCAGCCAAGCCGAGCTGCGCGACG 

AGGGTCTCGACGCGAATCGAATCGCGTGTCATAACTATGTCCCGGCCAAAGGTTTCCTCGACC 

GGGCCGACCACTTTGATGCCGACTTCTTCGGCATCCCGCCGCGCGACGCAGAAATCACCGATC 

CGCAAATTCGGCTTCTGCTTGAGTGCTGCTGGAACGCGCTGGAGCATGCCGGCTACCCGCCCG 

GCGGCGGCGAGATCGGGCTCTTCGCCGGCTCCTCGGCCAACTATCACTGGCTCGAATACGTGG 

GCATTTCCGAGGAGAGCAGCAATCGATTCGCCGTCATGATTCAAAACGAAAAGGACTACCTGG 

CCACGCGGATCGCCTACCAGCTCGATTTGAAGGGCATTGCCGTCACCGTGCAAACGGCCTGCT 

CGTCGTCGCTGACCGCGGTCGAGCTGGCCTGCGATGCGTTACACGCCGGCCGCGTGACCATGG 

CTTTGGCTGGTGGCGTTGGTCTGACCTATCCGTTGCGCGCCGGATACCTGCACGAGGATGGAA 

TGATCTTCTCCCCCGACGGTCGGTGCCGGGCCTTCGACGCCCAGGCGGCCGGCACGGTCTGCG 

GCAACGGT.CTGGGCATGGTGGTGC7GAAACAGCTCGACGCGGCGCTGGCCGACGGCGATGCCA 
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TCCACGCTGTGATTAAGGGCATCGCGGCCAACAACGACGGCGCGGCCAAGATCGGCTACACGG 

CGCCCTCGCAGAACGGTCAGGCGCGGGTGATCCGCGCCGCCCATAGGCTCGCCCAAGTCGCGC 

CGGAGACCATCGGCTATGTAGAAGCCCACGGTTCGGGCACGCCGCTGGGCGATCCGATCGAGG 

TGGCGGGCCTGACCGAGGCCTTTGACAGCCCGCGTCGCGGCTTCTGCGCCTTGGGTTCGGTCA 

AGTCGAATGTGGGTCATTTGGATGCGGCAGCGGGCATCGCGGGTTTCATCAAGGCGGTGCTCT 

CGCTGTCCCATCGGACCCTGTTCGCCAGCCTCCACGTCGACACGCCCAACCCGCAGATCCCGT 

TCGCCGACGGTCCGTTCCAGGTCAACACGGAGACCCGGCCCTGGCCAGCTGCCGACCATCCCC 

GCCGCGCCGGCGTCAGCTCCTTCGGCATCGGCGGCACCAACGTGCACGCCGTCCTGGAAGAGG 

CGCCGCAGTTGGCCGAGCACGCGGGGCGGCGGCGCGAGCGGCAGCTGTTCCTGGTCTCGGCGC 

GGACTGCAGCCGATCTGGAGCGACGCACCGCGGCGCTGGTCCGCCACCTGGCCGCGCATCCGG 

ACCTCGCACCAGATGACGTTGCCTTTACCTTGCACGCGGGCCGCAAACCGATGACCCACCGTC 

GTTTCCTC-GTCGCCGCCGACCTCGCGGAAGCCGCCGCGCGTCTGGCCGAGCCCGATCCAGTCA 

AATCCGCCGCGGCGCGCGCCGACCGCTGCCAGGTCTGGATGTTCGCCGGTCTCGGCTCTCAAT 

ACCCCGGCATGTGTGGCGGCCTCTATCGCACCGAGCCGGCCTTTCGCGAGCAAGTCGACCGCT 

GTTTCGACCTCCTCGCGCCGCGTTGCGATTTGAAGCCCTCGCTCTTCCCCGAGCCCGATCAGG 

CCATCGACGCATCAGCCCTCGCGGCCATCGACACCGCCCAGATCGCCGTCTTCGTCTGCGAAT 

ACGCGCTCGCACGGATGCTGGAAGGCTGGGGGCTGCGTCCGGATCGGCTGATCGGTTACAGTT 

TCGGCGAATACGTGGCCGCCTGCCTGGCCGGCGTCTTCTCCCTGCCCGACGCCTTGGCAATCG 

TCCGCGAGCGTGGCCGGATCCTGGCGGCGGCCGAGCCGGGCGCGATGGTCAGCGTGCCCCTTC 

CGGCCGAGCGCGTCGCGTCGCTGCTGGAGCCGCCGCTTGCCTTGGCCATTGACAACGGCCCCT 

CATGCGTGGTGTCCGGGCCGGTCGAACCGGTGCGCACCTTCACCGCTCGCATGAAGCGGGACC 

GGGTCTGGGTGACGCCGCTCCAGGCCGAGCGCCCGATGCATTCGCCGCTGATGGCCGAGGCCG 

GCGGCTCACTGCGCGCCATGTTGGCCGGGTTCCGCCTGAATGCGCCGCGAATCCCGATCTTAA 

GCAATGTTACAGGAACCTACCTAACCGACGAGCAGGCCCGAGACCCCGATTACTGGGCCCGTC 

ACC?GTGCGGCAACGTTCGCTTCGCCGACGGTGTGCGAACCTTGTTGGCCGAGCGCGATCCGG 

TGTTCCTTGAATTCGGGCCGGGCCGCGATCTGAGCTCCTTGGTGCGCCACCAGATGCCGGAAG 

GCGCCGACGAGCCGATCGCACTGATCCGTCATCGCGAAGATCCGGTGCGCGACGAAGACCTCC 

TGCTCGATGGCTTGGGCCGCTGCTTCCTGCGTGGGGCGACCCTCCACGGGCAGGCCTTGTACG 

CCGGCCGAGGCTGCCGCCGCGTGCCGCTGCCCGGTTACCCGTTCCAC'GGTCCACGCTGCATGC 

CGGCCCGCGCCGGACTGCCCGGCCTGGCGCGACCGACCGTGGGAGCGACCACCATCAGCTACC 
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GACCAGCCTGGAAGCGGGCGCCGCGCTTGGCGGCTGTCGAATCGCTCGCGCCGCAATCCTGGT 

TGGTATTCAGCGACGGCAGCGAATTGGCGGGCGAGCTC<3TGGCCGGCCTGCGCGCTTCCGGTT 

GCGCGACCACCCTCGTCGAAGGTGGGCTGGCGTTCGCGCGCTTCGCGGGCGGCTTCCGCGCGA 

ATCCCCGCGAGGAACAAGATCTCGCACAGCTGTTCGCGACCCTGTCGGCCGAAGCGATGCTGC 

CCACCCACATCCTGCACCTGCTCAGCCTGCCGTCGCCGGAGCGCGACTCGCCGCTGGCGCGCC 

TGGAGCACCTCACCGAGCTGGGCTTCCACCATCTGCTGGCCCTGGCCCGCCAACTGGAGGCGG 

TCGGCGCCCCCGAGGTCCGCCTCGCCGTGGTGACAACCGGCCTGGCGGCGATTGGCGGCGAGT 

CCGAGCTGCGGCCCGAGGTCGGGCTGTTGCGGGGACCTGTCCGCGTGATTCCCTTTGAATTCC 

CGAACTTGCGGCTGCGCCTGATCGACCTCGACTCGGCCGATCCCATCTGGCGTAGCGGTTGTG 

AGCCGTTGCTGCGCGAAATGGGCGCTGCCCCGGGACCTGAAGAAATCGCGCTGCGCGGCACCA 

GCCGTTGGGAGTTGGGCTACGAGCCGGTCGAGGGGGGCACCGTGAGCACCATCTCCTCGCGAC 

TGCGCGAGGGCGGCGTCTATCTGATCACCGGTGGCCTCGGCGGCCTGGGTCTGGCCTTGGCCC 

GTCACCTCGCCCGGAAGTACCGCGCCACCCTGATCCTCGCTGGCCGGCGAGGCGCGCCGGCGC 

GCGAGCTCTGGCACCAGGCGCCAGCGGAGTTCGTACCGGTCGCAGCTGCGATCGCACAGATGG 

AGGAGTGTGGCGCCCGCGTGATTCCCGTCGCGCTCGACGTCACCGACGCCGACCAAGTGAACG 

CGTTGTTCGCCACCATAGAAGCTACGGTCGGCAAGATTGAAGGCGTTTTCCACATGGCTGGCA 

TCGTTGACGGCGGCATCATTCGAACGCGCACGCGCGCTGCCAGCGACGCCGTGCTGGCGCCCA 

AAACGGTCGGAACCTGGATTCTCGATCGGGCTCTCCGCGGCGCCGGTGGCCGCTTCCTGGTGC 

TGTACTCCTCGATCAACGCGGTCGTCGCGCCCTTCGGCCAGGTTGCCTACGCCGCCGCCAACG 

CCTTCCTCGACGCCTTCGCCAGCGCCCACGAACACGACGAGCGTCTTTTCCGCGTCAGCATCG 

GTTGGGACACCTGGCGCGAGGCCGGCATGGCCGTCGATGCCGCCCGCGCCCGCGGCGACCAGG* 

CCCCGCTCGAAGGGCTTAGCGACGAGCAGGGCTTGCGCCTGCTCGAAAGCGCCTTGGTCGGTT 

GCGAACCGCGACTCCTCGTCTCCATCAGCGAACTGCGCGCTCGACTAGCCGAGCATCATCGCA 

ACGGCGGCATTCCCCGGTTGCTCGGGCCCCGCGCCAACGAGGCGGGTGCAGCTGATTCCGGCG 

AGGAGGGCGCCACGCAAGACGCGTC3CCGGCCCGTCGCGCCCGTCCCGATCTGGTCGTGGCCT 

TCGCGCCGGCCGGCAACGAGCTGGAGCGCCGGATCGTGGCCATCATCGGCGCCTACCTGCGGC 

TCGGTCAGGTGGGCGTCGACGACAACTTCAACGATTTGGGCGCCACCTCGCTCGACCTCATCC 

AGATCGCCCAACGCCTCGGTCGCGAGTTGGGCCGCGATGTCCCTGTCGTCTCGCTCTACCAAC 

ACCGCACCGTACGCGGGCTGAGCCGCTTCCTCGGCGGCGCGCTCCAATCCGCGCGGTCCGGCG 

TCCCGACGGGCGCTGCCGCACCGG3CGCCGCCACGCCGGGGGTTGCCACCCCGCCGCGGCCAC 
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AACCGTCGCGCCAGCACCTGGAAAAACGCCGTCAATTGAGGAAAAAAGGGGGGCCTTCCCATC 

ATGAGTGAAGTATCCATTCGCCCCGGCTTGGACATCGCGGTCATCGGCATGGCCTGCCGCTTT 

CCCGGTGCCCGCAACCTCGCCGAGTATTGGGCCAACCTGATCGAAGGCCTCGAAACGCTCAGC 

TTCTTCAGCGAAGAGGAGCTGCGTGAGGCCGGCTGCGATCCGGTCCAACTGGCCCAGCACAAC 

TACGTGCGCACCAAGGGCCTGCTCrCTGACGCAGACCGTTTCGACGCCGATTTTTTTGGTTAT 

TCCCCGCGCGAAGCCCAGGTGATGGACCCCCAGATCCGCGTCTTCCACGAGGTCTGTTGGCAG 

GCGCTGGAGCACGCGGGCTACAACCCGCATCGCCACACCGGCACGATCGGCCTGTTCGCCGGC 

GCCGCGCCCAACGTTTTTTGGGAGTTTCTCTCCTATCGGTCCGATGCCGCCAATTTAGGCAAC 

TTCACGCTGGGCCTGCACAACAACAAGGACTACCTGAGCTCGCGCATCGCCTACAACTTCAAC 

CTGACAGGGCCCAGCTACACCCTGTTCACCGCCTGCTCGACCTCGATGGTCGCCATCCACCAG 

GCCGTCCAGGCGCTGCTCAACGGCGAATGCGACCTGTGCATGGCCGGCTCGGTCTCCATTACG 

CTGCCACTGGTTGCCGGCTACACCTACACGCCGGGCATGATCGTCTCGCCCGACGGCCATTGC 

CGCACCTTCGACGCAGGCGCCAATGGCACTGTCTACGGCGACGGGGCCGGCGTGGTCGTTCTC 

AAGCGGGCCGAGGATGCGTTGGCCGACGGCGACCACATATTTGCGCTCATCAAGGGCTCGGCG 

CTCAACAACGATGGCAGTCGCAAGACCGGCTACACCGCGCCCAGCGTGCAGGGGCAGGTGGAG 

GTGATCCGCGCGGCGATGAACCTGGCGGAGGTCGAGCCGGAGGCGATCAGCTACGTGGAAACC 

CACGGGACGGGCACCACGGTGGGCGATCCGCTGGAGTTCGAGGCGCTAAAGGAGGCCTTCGGA 

GGTGGCTGCAAGGCCTTCTGTGGArTGGGTTCGGTCAAGCCGAACATCGGCCATCTGGACGTG 

ACGTCGGGGATCGCGAGCTTCATCAAGCTGGTCCTGGCGCTGGAGCACCGCATCCTACCGCCC 

ACGCTCCACTTCCAACTGCCCAACCCGAAGATGGATGTGGTCGATAGCCCCTTCTACATCGTG 

GCTGAGCGCGAACCCTGGCGCGAAGATCTGCTGCCGCGTCGGGCCGGTGTCAGCGCGTTCGGT 

CTGGGTGGCACCAACGTCCACATGATTTTGGAGGAGTTTCAGCGCGAACCGGCGGCGAACAGC 

GCGCGCACGCGCCACCTGACGGTGCTGACGGCGCGGTCGCCGCAAGCCCTGGCGCAGCTGGCG 

GCCAACCTCGCCGAACACCTGCGCGAACACCCCGAGTTGGCGCTGGCCGATGTGGCCCATACG 

CTGCTGCACGGCCGCAAGCCACATCCATTCGCGCGCATCCTGGTGGCGACCGATACGACGGCG 

GCGATCGACGCCTTGATGAACGACCGCGATCCGCGAACGCGTTTCTTCGAAGCGACCGGGCGC 

GGCGAGTCGGTGATCCTGTGTTTTGACGAAACGCCGCCGGAGCCGCGAAGCGCCCGCTACCTC 

TGGGATCACGAGCCGCTTTATCGCGCGGCGGCGACGTCGTGCTTGGCTGGTGAGGTCGCCGAC 

CCGGATCTGGAAGGCTGCTTTACT3CCCTGATCGCCGAGCAGGGCGCGGCAGCCGCCTTTTGC 

CACCAATACGCGCTGGCCGGATGGrTGCTGGCCATGGGGTTGACCCCGTCGGCGTTGATCGGC 



wo 00/22139 



PCTAiS99/23535 



74 



GTGGGCCAGGGCGAGTGGGTAGCAGCGGCGCTCGCGGAGGTGTTCCCGCCATCGGCCTGCTTG 

CGCTGGATTAGGTTCGGCGAACGGCTCCCGCAGCCGCGCGATCAACGGATTCCGTTTCTCTCC 

AATTTCTCTGGAAACTGGATCGTTGGGCGTGAGTTGGCCGACCCGGATTACCCCAGAAAGCAG 

AAGGGTAAGCGCTGCATGAAGCGCCGTCGGTCCCAACCTCGGTCAGCTGGTGCAGGATGGGGG 

CGATGGAACCGGCTCGGTCAGCTCGTCGCGCGCTGCTCTTCCGCGGGAAGCGGAGGCGGGACG 

GTGATCGGCCCGAGGGCGAGGTTCATCTCGTCGTCGACGAGCCGGGCGCGGGTGCGCGCCCAG 

TACCTGGGGGCGAGCTCGAGGTAGCGGTCCCGCGGCCAGTAGGGCATCGCGCGAATGACGTCG 

GCCAGGTAGGCCTCCGGGTCGAGCCCGTGCAGCTTGCAGCTCGCCACGAGCGAGAAGAGGTTG 

GCCGCGGCGGAGGCGTGGTCGTCGCTGCCGAAGAAGAGCCAGGACTTTCTCGCAACCGCAATG 

GATCGCAGCGCTCGCTCGCTGGCGTTGTTCTCCAGGCGCAGCCGACCGTCGTCGAGGAAGCGC 

CGCAACGGCTGCTCTTGGTTGAGGGCGTAGCCGAGCGCGGTGGAGACCAGGCCGCGCTCGCGG 

GGACGAGCGTGCTCGGCCCTGGCCCAGGCAAAGAACGCGTCGACCAGAGGGCGGACGACGACA 

TCGCGACGCACCTTGCGCTGCGCGGGCGGCAGGTCCGCCAGCGCGCGATCGGCGGCAAAGAGG 

GCGTTGATGCGCCGCAGCCCCTCGACACCGAGCTCGTGCTTGCAGACCGCCGCCTCCCAGAAG 

TTGGTACGGCAATGCGACCAGCATCCGACTTCGGTCGGGGGCGGACCGCGCTTCTCGTCGGCA 

GCAGCGCCTCTTGGTGGTGTGCCGCGGAAGAGGGCGTCATAGATGGCGTGAGCGTCAGCTTGA 

ATATACCGAGAGAAGCCGCGGAACATCTCGCAGACCGCGGCGCTGGTATGCTTGGGCTGGTAC 

TCGAAGAAGACGTGATCCTTGTCCGCGAGGACGACGAAGAAGTGTCCCTTGCGGCACGGCCCG 

GGCTTCTTGTCCTTGCGCTCCTGGATGGGCCCAGGCTGGACGGAGACCCCGGTGGCGTCCGTG 

GACAGGCAGAAGGCGGTCTCGAAGGCCTCTTTGCGCGCGGCCTCGACGATGGCGCCCAGGGTC 

GCACCGACGTCTTCGGCGTAGCGGCACATCGTGCCGCGATCGAGCGACGCGCCCTGAAGCTCC 

AGCTGCTGCTCCAGTCGATAGAACGGGACGCCGAGCAGGTACTTGCTGGTGAGGATGTGCGCA 

ATCATCGACGGCGCGAGGAACGACCGCCGGAACAACTCCTTCGGAAGCGGCGTCGTGATGAAG 

ACCGTGCAGGTCTCGCCCTTCGGCGCCGGCGGCGGCGCGTCGAGCGAGGGCGCGTCGAGCGCT 

GrGGAGGAAG'CGCTCGGCTCGCCGGCCGCTGCCGTGTCCTCCGGGCTGACGCTCGCAGCGGGC 

GTCGGCGTCGGGGCTTCTCTCGCGACGACCTGGAGCGGGGCCGCTTCCTCCTCGCCCGAACTG 

CTCGCATCCGTGACGGACCGCTCGGCCTTGTACACGACGCGTGCGAGCACGATGCGGCGCATT 

CCGCCGCGCTCGTAGCCGAGTCGCGAGGTCTCCTCGACCCCGATGCGCGTCGCCGTCGCATCG 

AGCTCGGGGCAGGAGAGCTCGATGCGGACGACGGGCAGGTCGGACTCGGACAGGTCGCGACGG 

CCCTTGCCGCCGGACCTTCGTTTCGGCCCCTTGGGGTCGTCGTGCTGCCGCTCGTCGCCTGTA 
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TTGCGCTCGGCGGCGTCGAGTGCCTTCGCGAGGCGCTGGACCTCGAGGAACATCGAGTCGAAC 

GCCAGCTGCTCCGCGCTCACCTCGGCGCGCTCCGCCTTGGCCACGAACAGTCGACGTCGCAGA 

AGCTGCAGCTGCTCGAGCGCACGGGTGTAGGCGCGCCGAAGCTGCGCGAGCGCATCGCGCGCT 

CCCACGAGCTCGCTCTTTGCCGCGGCGAGCTCCGCTTCGAGCTGCGCGATGCGCTGCTGCTCG 

GCCGAGAGCGTCGGCTTGGCGGCGGCGTCGTGCACGACGCCGCTCTACGTAAGCCGCGCGTAC 

TTGTCGAGCGAATTCGTGCGGCTCAGTGGACGCGGCGCGGTGCGCGCCTTCGCGGTTTGGACG 

TGGGCGCGATCTCGATGCCGTCGAGCAGCGTCTCGAGCGTGGCGTCGTCCACCTCGACGTGCG 

TGGCGCCCTCGGTCGGGGGGTCGGGAAGTGCGAACGCTCCGCGATCAAGGCGTTTTGAAAACA 

GGCAGATTCCACTGCCATCGAAGAAGAGAATCTTGATCGTGGTCCGCCGCTTGCCGACGAACG 

CGAACAGCGCTCCGCAGCGAGCCTCGTACCCCACACGCTCACGGATGAGACCCGAAAGCCGCT 

CGAAGCCGTAGCGCATGTCCACCGGCTCCAGCGCGACGAACACCTGCACGCCCGCCGGAATCA 

TCGCCCCGCTCCGCCGAGGGCACGGACCACCTCCGCCAGCAGCGCGGGGTCGAACCCCGCGGC 

GACGCGCACCCGCGCGCCGCCGACCTCGACGACGAGCTCCGCAGCGCTGCTCGTCACGGCGGG 

CGCCTTCGGCACCAGGCGCAGAAAGCGCGGTGGCTCGGCCCGCGACAGCCGGCTCGACCAGCC 

GTGCAGCGTCGAGGCCGCAAATCCGCGGCTCCGAGCGAACTCCTCCGCCGTTTCACCACTCTC 

GCGCCACGCCCGAACGCGCTCGGACCACATCACTTCGGTCGCCTTCGTCCTTGTCATGCACGC 

CATCATGAACTGGACAGCGCAGCCGGGGTGAGACGGCGCTTCGCGCAGCGCTTACGCAGAAGG 

CGCGCCGCGCGCCATTGTCGGATGCGGTGCGCGACTTCGCCGCCGATCGGCTGTTGCTGGAAC 

TGGGACAACCACTGGACGTAACGGCTGAAGCGAGCCAACGGCTCCAGCTCGCGCGGGGCGACC 

TGTTCGGCGCCTACCAAGCGTTGGCCCAGCTCTGGATCTGCGGCGCCCTGGCCGAACCGCCGC 

GACTGTATCCCGACGAACACCGCCGGCGCGTGCCGCTGCCGAGCTACCCCTTCGAGGGAAAGC 

GGTTCTGGATCGAGGGCTCGCCGTTCGAAACCGCGCCCGCCGCCGGCGCCTCACCCCAACCCG 

CCGATTCGGGGGACATTCTCAAGGGCGACCCGGCGGACTGGTACTATCGGCCGCGTTTCGAAG 

CGGCGCCGCTCTTGCCCAGCCCGTTCGAGAGCGAACCCGGCGATTGGCTGGTGTTCGAAGATG 

AGCTGGGGCTCGGCGCCTGGCTGAGCGAGACCTTGCGCGACAAGGGCGCGCGGGTCGCGACAG 

TCGTTCGAGGCACCGAGTTCCGACGCCTGGCGTCACAGCGCTTCCAGCTTCGTCCCGATCGAC 

GGGACGATTACCGGACCCTGCTGCACGAGTTGAAGGCGCAGGGCATCGCGCCGGTCCACCTGT 

GCCACCTATGGAGCGTGACCGCCGCACCGGATGCCGAGCAGTTGCTCGACGTCAGCTTTCACA 

GCCTGGTCCATTTGGCGGCCGCTTTGGGTTCGGTTGGCTACTTCCACGCCATGAAGTTGAACG 

TGGTCGCCAACCGGCTATTCGACCCCGAGTCGCCCGAGCGCACCGAGCCCGCCAAGAGTCTGT 
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TGCTCGCGGTGACCAAAGTCCTGCC3CAAGAGGTGCCCAACGTTCGAACCCGCGCCATCAGCG 

TGGACCTGGATCGCTCGTTCGACGCGGCGGCGCCCGCCTGGGCCGCCAGTTTGTTGGTTGAAT 

GCGGCGCGCCCGTCGAGGAAACGGTGGTGACCTACCATGGCGCAGCCCGATGGCTGCGCCGCT 

TCGATCGCGTTGCGGTGAATGGTCTCGGCCCGTTCCACCCCGATCAACCTGCGCCGCTGCTGC 

GCGAGCGCGGCGTGTACCTGATCACCGGCGGCCTGGGCGGCGTGGCTGGCCAGTTGGCGCGCT 

ACCTGGCGCGGGCCTGCCGGGCGCGGTTGGTGCTCACCGCGCGCCGGCCCCTGCCCGAGCGCG 

ACCAGTGGGATCGGGAGTCGGCCGTGCTGTCATGGGACGACAAGACGCGCCAGCGCATCGAGC 

TGGTGCGCGAGCTGGAGCGGCTGGGGGCCGAAGTATTGGTGGTGGCTGCCGATGTCGCCGACG 

AAGCGGCCATGGCGCAGGCGATCGAGGCCTCACTGGCGCGATTCGACGCTTTGGACGGCTTGA 

TCCACGGCGCCGGGATCGTGCGGG7CGCGTCGGGCCGCACGCCGATCGGGAGTATGACGCGGG 

CCATGTGCGAGGAGCAGCTCCGCC C CAAGATGTTGGGCCTCGACGTCGTCGACCGCCTCCTGC 

GCGATCGCCGGTTGGACTTCCGCATTGCCATCTCGTCGCTCGCCCCGATTCTCGGCGGCCTCG 

GCCACGTCGCCTACGCCGCCGCCAACCTCTACATGGACGCGTTCGCGACGCGCGCCGCCGCCG 

GCAACGCGCCTTGGATCGCGCTGAACCTGGCCGAGTGGGAATACGAGGGCCCGGCTACCTACG 

ACGAGCGGGTGGGCCGTTCGCTCAAGCAGCTCGAGCTCACCAACGAGGAGGGTATCCGCGTCT 

TCCAGACGGTGTTGGCCTTGGCCGC3CGCGGCCCGCTACAGCAGATCATTATTTCCACCGGCG 

ACCTCCAGGCCCGCCTCGACAAATGGATTCACATCAAATCCCTGCATCGCCGACCGGGGCCGG 

TCCAGCTCAGTCGCCGGACCGCGGCACCCCAGGGCGGTTTCGGCTCGGAGCGCGCCGCCTTCG 

AGGCCGCCTTCGCTGACGCCTGGTGCGACTTCTTCGGGGTTGAAGAGGTCGACCCGAACAAAA- 

ACTTCTTCGATCTGGGCGCCAGCTC3CTCGACTTCATCCACCTCGTCAGTCGCTTCAGCAAGG 

CCATCGAACAGCATGTACCGCTCGAGGCCCTGCTCGAACACTCCACCCTGCACGACCTCGCCG 

CCCACCTCGCGGGCGACGCGAACACCGACGCCAGCGACGAAGCGCGCATTCGCCAACGGCTGC 

AAGGCGCCAAGTCCGGCGACATCGCCATCATCGGCATGGCCGGCCGCTTCCCGCTCGCGCCCG 

ACCTGGACACCTATTGGCGCAACCTGGTCGGAGGCATCGACGCGGTCAGCTTCTTCAGCGCCG 

AGGAGTTGCGTGCTGCTGGCGTCACCGCGGCCGAGATCCACCACACCAACTACGTGCCGGCCA 

AGGGGCGCTGCGCCGACCAGGACTTGTTCGATGCGGCCTTCTTCGAATACACTGCCAGCGACG 

CCGAGCTGATGGACCCGCAAAATC3CGTGTTACACGAGGTCGTGTGGCACGCGCTGGAAGACG 

CCTGTTTCGACTTCAACGGCGATCACGGCCAGGTCGGCCTGTTCGCGGGCGCCTCGCCGAACC 

TGTGGTGGCAGTTCGTGGCCAGCTTTTCCGAGGCCGCCAAGACGCAGGGCATGTTCACCACCA 

CCCTGCTCAACGACAAGGACTCGA7CGCGACCCAGATTTCATACAAGCTCGGTCTAAAGGGCC 
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CGATCTGGTCCGGTCAATCGGACA7GGCCGTGGCCGGCGCGGTCTCGCTGACTCTCCCCGATA 

AGGCCGGCTACATCTACGAAAAGGGCATGCTCTTCTCGGCCGACGGCCATTGCCGGGCTTTCG 

ACGCCAACGCCACCGGCATGGTCTTCGGCGACGGCGCCGGCGCGATCGTGCTCAAGCCGTTGG 

ACGCGGCCCTGCGCGACGGCGACCC3ATCCATGCGGTGATCAAGGGCTGCGCCACCAACAACG 

ACGGCGACCGCAAAGCCGGCTACACC-AGCGTCAGCGCCCAAGGCCAGGCCGAGGTGATCCGCT 

CGGCCCAGATCCTGGCCGACGTGGC3CCCGAATCCATCAGCTACGTGGAAGCCCACGGTACCG 

GCACCAAGTTGGGCGACTCGATCGAGATCAAGGCGTTGAAGCAAGCCTTCGCCAGCGACAAGA 

ACGGATTTTGCGGCATCGGGTCGGrrAAGACCAACCTCGGTCACCTGATGGCGGCGGCGGGGA 

TGGCCGGCCTGATCAAGACGGTTCT3GCGATGAAGCACCGCC:AATTGCCGCCATCGCTGCACT 

GCGACGAAGTGAACCCCGACCTGGAGTTGGAGCGCAGTCCGTTCTACATCAACACCCGCCTGC 

GCGACTGGGTTGCACCGGGCGGGCCr-CTGCGGGCCGGCGTGAGTTCGTTCGGGATCGGCGGAA 

CCAACGCTCACGTCATCCTGGAGGA3CCGCCGACGCGCGAGAGCGGCACGCGCATGCGCCACT 

GGAAATTATTGATGCTGTCGGCGGCCAGCGAGGCGGCGCTCGACCGCCAGGCCGATAACCTGG 

CCGACTACCTGGAGCGCCATCCCGAGGCCCACCTCAGCGACGTGGCCTATTCCCTCCAGACCG 

GCCGGCGCGTTCTGGCCTGGCGGCGCACGGTCCTATGCGAGTACCGCGAGGACGCGGTGACCA 

GTCTGCGCGAGCGACAGGCCAAGCC-CGTCCAGACAAGTCGCGTCCGCTGGGACCACAAGGACG 

TGGTCTTCATGTTTCCCGGTCAGGGCGCCCAGTACCTCAACATGGGCCGCGACTTATACGTCA 

TGGA3CCGGTCTTCCGCGAGGTCATGGACCGCTGCTTCGAGTTGCTGGCCCCTTTGTGGTCCG 

AGCATCCGCGCCAGATCCTTTATCCGGAGGGCGGGGTGTCGACCCTGCTCCACCGGACTGATT 

ACACCCAGCCGATCGTGTTCTGCTrCGAGTACGCCCTCGCCCATTTGCTGCTCTCCTGGGGAT 

TGAA3CCGGCCGCGACCATCGGCTACAGCTTCGGCGAGTACGTTTCTGCCTGCCTCGCCGGCG 

TCTTCTCCCTGGAAGATGCGATCCGTCTGGTGACCGAGCGCGGTCGGCTGATGGCGGCTTTGC 

CCGC-3GGCGCCATGCTCAGCGTCCC3GTTCCCGAATGCGAGCTGCTGCGGCTGCTGGACGGCT 

TCCACGCCCAATCC-GCGGCCCATC73GCGCTGGCCGTCGACAATGGCGCCTCCTGCATTGTGG 

CCGGCGAGCAGGCCGCCATCTCGG3CTTCGAATCGATGCTTCGCAAGAAGCGTCTGTTGACCA 

TGCGGGTCGCGGTCAGGCACGCCGCrCATTCGCAGGTCATGACCGGCGCGACCGACGCCCTGC 

GCAG 3ATCCTGCGGAAGATCCCCCT 3TCCGCGCCGACAATTCCCTTCATTTCCTGCGTCACCG 

GCAC3TGGATCACTGCACAGCAGG3TACGGATCGCGAGTATTGGGTGAACCACATGTGCGGGA 

CGG7 3CGGTTCGCGGCGGGTCTG A Z 3GAGCTGGGTCAAAACCGCGAGGCGGTGTTCCTGGAAG 
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TAGGTCCGGGCCGCGACTTGACGTTGCTGGCCCACCGCATCCTGGCCGACAGCGCGGCCGTGT 

TCGAGCTGGTCAAGGCGCCCGACGGCGGCGACGACGATGGGTTCCTCCTGCTGGATCGATTGG 

CCAAGCTCTGGAGGCTGGGGATTTCGATTGACTGGGCCGGCTTCTACGCGGATGAGCGGCGGC 

GGAAACTCTCGCTGCCGGGATATCCGTTCGAGCGGCGGCGCTTCTGGATCGAGGGCAACCCGC 

TGGAGATCGCCGCCGGCAGGCCCAATGTCCAGGGGCCGCTGGTCAAGGCGTCGGACATCGGCG 

CTTGGTTCTACGTGCCGCAATGGCGGCGGTCGGTGCTCGCCGAGCCGGGTACAACGGCGGCGG 

GCGCCGCCGTCACGGCGGAGCAGGCACGCGTCGTGACCGAGCTACGGGCGGGATGCGCGTCGG 

CCGGCTTGGGCAGCGGGGCCTGCGGACTGAATGGCGGTGCCCCGTCCGAGCGTCCGAAAGAAA 

GTGTAGCGCCAGCCGGGTCGACCAGCGCAGCGGCGCAGACCGGCGCGGACTGCCCGACACCGA 

CTGGGGAGCCAGCGGCTGTGCCAAAGGACGGGGCCGAGCCGCGGCCGACCTGGCTTATTTTCG 

CCGACGCCGGCGGATTGGCCGAATCTTTCGCCAAGCGGGTTCAGGCCCGCGGCGAGAAGCTTT 

ACCTGGTGGCTTCCGGCTCGCGCTTCGAGCGCCTGGCCGAGACCCGCTTCCGCCTCGATCCCG 

GGGCCAAGTCCGATCACCGCCTGCTTTTCAAGGCGCTCGACGAGGCCGACATCCTGCCGACCC 

ACCTCCTCGACTTCCGCTCGCTTGACTGCGGCGGGCCCGACGCCGACCCCATGGACCAGGCCG 

GCTTCTTCGGGCTGTTGCACCTGGTCCAGGCGATGGCAGAGGCCGGCTACAGCCATCCCATTC 

GGCTGCTGATCGTCAG7TGCGGCGTCTACGATGTCACCGGTGCCGAACCGCTGCAGCCGGCGC 

GGGCCACGATGATCGGACCGGCTCTGTGCATCCCGCAACAGTATCCGCACCTCGAAACGAGCC 

ATGTGGATTTGGGCGTGGTCCATGCCGACGAGCTCCACGCCGCGCGCCAGCTCGACAGCCTAC 

TTGCCGAATGCCTAAGTGCAACGGCCGAGCGCCAATTGGCGCTGCGCGGCCGACACCGCTGGC 

TGCTGGACTACGAGCCAGTCCGCTTGCCGCCGCTCGACCCGGGCCGTCTGCCCTGGCGCCAGC 

GCGGGGTCTACTTGATCACCGGCGGTTTGGGCGGGATCGGCCGCATCCTGGCCGAACACCTGG 

CCCGCACGACCTCGGCTCGCCTGGTCCTAATCGGCCGCGAAACCCTGCCCGACCGCGACGACT 

GGGACGCCTGGCTGAACCGCCCGCAACCGGTCGACGCCACCCACGAACGGCTGCTGCACAAGA 

TCCGCGCGATTCGCGATCTGGAAGCGCTAGGCGCCGAAGTCCTGGTCCTCGCCGCCGACGTCG 

CCAACGAAGCCGCCATGCGCGAGGCCTACGATCGCGCCGAATCCCACTTCGGCACAATCCACG 

GGG7GATTCACGGCGCCGGCCTGATGGACGCGCAAAGCTTCTCACTGATCGACGCCCTCGACC 

ACGACCTCTGCGCCCGCCAGTTCGAAGCAAAAATCCGCGGCGTCTGCGTGCTCGACCGCGTTC 

TGGCCGACCGCACGCTCGACTTCTGTCTGCTGATGTCTTCCATCTCCACCGTGCTCGGCGGCC 

TGGGCTATTTCGGTTACGCCGCGGCCAACGCCTTCCTCGACGCCTTCGCCCAGGCGCGCAGCC 

GCGACGCCGCTTTCCCCTGGCTTAGCGTGGCCTGGAGCGATTGGAAGTACTGGACCGAGCGCA 
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AGATGGACAACGAGGTCGGCGCCGTCATCGACAGCCTCTCGATGGAACCCGCCGAGGGCTTCG 

AAGCCGTCACCCGCGTCTTGGCTTGGGGCAAGGCGCCCCACATCGCCAACTCGCCCGGTGACC 

TCGGTCGCCGCCGGGATCAATGGGTCAAACTGGCCAGCCTGAAATCGGCGCACTCCAGCGAGC 

CCGAGCCGGCTAGGCATGGACGTCCGGCGCTCTCCAGCGAATGGGTCGCGCCGCGCAACGTGG 

TCGAAGAGAAGCTGGTCGCCATTTTCGAGCAGGTGTTCGGCACTGCGGCACTGGGCATCGAGG 

ACAACTTCTTTGAGTTGCGCGGCGACTCGCTCAAGGCGGTCATGACCGCGGCCCGTATTCAAA 

AGGAGCTGAACGTGGAAGTGCCGCTGCCGACCTTCTTCCAGATGCCCACGGTCGCTGGCCTGG 

CCCAGTTCGTGACGCAAGCCAAGCGCAGCGGCCGGGAGACGATTCGGCGCACCGCGCCGCGCC 

CACATTACCCGCTCTCGGCTGCCCAGGGCCGCCATTACCTGCACTACCGCATGGACCCGCGTT 

GTACCGCATACAACGATCCCTTCGCCAACCTGATCGAGGGTCCGCTGGACGTGGATCGCGTGG 

AGCGCATCCTGCACACCCTCATCCTACGCCACGACTGCTTCCGCACCTCGTTCCACTTCCGCG 

AGGGCGAGCCGGTCCAGGTGATTCACGATCGGGTGGACTTCAACCTGGCGCGGATTACCTGCG 

CGCCCGAC-GATTTGCCCGAACGGATGCGCGATTTCATCCGCTCCTTCGATCTGGAGCGACCGC 

CCGCCATGCGCGCCGGCCTCTTCGTCACGC^CCCGAGCGCCACGTGCTGCTAATCGATTTTC 

ACCACATTATCACCGATGGCGTGTCGTTCGAGAACTTCGTCGGCGAGTTCGCGGCGCTCTACC 

GCGGCGAGATCCTGCCCGAGCTGGAACTCGAGTACAAGGATTTCGCGGTGTGGCAGCATGAGA 

ACCGGGGCCGCCGCGCCAACAGCGACCAGGCCCGCTACTGGACCGAGCAGTTGGCCAATGCGC 

CCGGGCCGATCGAGCTAACCACCGATTTCCCCCGTCCCAGTCGACGCAGCTTCCGCGGCGACC 

GCGTGCGGACCGTGCTTGATGCGGAGCTCGTTGCTCGACTCAAAGAGCACGCGGCGCGCCTCG 

GCATCACCCTCTATAGCCTGCTGCTGGGCGGATTCTCGTTATTGCAGCACAAGCTCTCCGACT 

CGCACGACATCGTCATCGGTTCGCCCGTCGCGGGCCGCACCCGGAGCGAACTCCAGGATCTGC 

TGGGCGCGTTCGTCAACACCCTGCCGATGCGCCACCGCATCGACCCGACCCATACCGCACGGG 

TCTTCTTGGAGCAGGTCCACCAGACAACCTTGGCGGCCCTCAGCTACCAGGAGCACCCTTTTG 

ACGAAATGGTGGCGACGCTCGGGTTCGCCGCCGATCCGGCTCGCAACCCGATCTTCGACACGA 

TGTTCTTGCTGCAGAACATGGCCATGGGTGCAACCACCATTCCCC<3TCTGCGGCTCTCGCCTC 

ACGACACTTTTCACCGCAAGGCATTGTGCGACCTGATGCTACAGGCGACCGAGTATGACTGCC 

ACCTGGAGCTGGTGCTCGAGTTCGCCACCGACCTGTTCCGGCTGGAAACCGCGCAAGTCTTGC 

TCGACCGCTACCGCCAAGTCTTGGAGTGGCTGTTGGCGTACCCCCATGAATCGATAGACGATT 

TGACGCTCGCCGGCCACTTTCGCGAAGTCGAAGTGACGATGTCGGACGAGGGCGACTTTGATT 

TCTCAGATT.TCGAACCCCGCAACGTGAGAAACCTATGGCGCGCCTGAGCCGCACAGATCTCCA 
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ACTCGCCATTCACCAGCGCACCGTGGAGCGCGAATATTGGCGCGCTCTGTTCGAGCGCCATCC 

GCAACGGTCCAGTTTGCCGGGGGT3CTCACCGCCCCGATCGGCGACGAGTCGACCCGCGAGAC 

CTTGTCATTCGTCCTCGACGAAGATCCCCTTCGGCTGAGTAATCGTTCGCCGCAACGCCTGCT 

CACGGTGTTGGCGGCTGGCCTCGCGGCTTTCCTCCACCGCTGCGACGGCGCTGAGCGCTTCAC 

CCTGGGGTTGGCCCTACCGCGCCA.a.GCCGATGACCATCACCCGATCCTCAACAGCTTGATCGC 

GCTGGGGGTCGCGGTCGACTCGAGrACGACCTTCCGCGATCTGCTCTATGCGCTTCGATCCGA 

ATACCACGAGGCGATGCGCCACGCCAACTTTCCGCTGGCGACCTGGTGGCGCGGCCTACCCGG 

CGGAACGGCGCCGTTCGACGTCGCCCTCAGCCTGGACCCCTTCACAGACGGCGATTCGCTGGA 

AGACCACGCGATCGGCGCGTTGTTCCGGTTCGCATTGGAGGGTGAGCGCCTCACCTGCCGATT 

GCGATTCGACCCTGCGCGCTATGACCGTCCCGCGATCGAAAACCTCGCCGATCGTTTCGCCCG 

CTTCCTCACGCGCCTGTGCCGGGAC3CCTCCACCGTCATCCAGGCGCTGGACCTTTCGCTGCC 

AAGCGATGAATCGGTGTGGCGCGTCACTGAAGGCGTGCGGCGCGGCTATTCGCAAGACCTGAC 

GCTAGACCGCGCGTTCCGCCGCCAGGCCGCGCAAACGCCCGATCAGCCGGCGATGACGTTGAA 

CGGGGACGTCCAGAGCTACGCCGAGGTCGACCGCCGCAGCGACGCGCTGGCCCGCCACCTCCG 

TCGCCACGGCGTCGGTCCGGAAACGATTGTGGCCGTCAACGCCCGGCGCGGGCCTAATCAGCT 

GACGGCCCTGCTCGCGGTCCATAAGGCCGGCGGCGCCTACCTGCCGATCGATGCCGAGGAGCC 

GGCTGCCCGCCAGCAATTCAAGGT3CGCGACAGCGGGGCGCGGTTGGCACTGGAGCCGTCGCC 

GGACCAGGCGCTGACCGTCACCGACCTGCCGCGGCTCTTCCTGGACGATGCCTCGCTCTTCGC 

TGACGGCGGGCTCGATGTGCCGCGCGGCGCCGACTCGCTCAATCCGGCCTATGTGATGTACAC 

GTCCGGCTCGACCGGACAGCCCAAGGGTGTGGTGGTTCCCCACCGCGGCGTGGTCAATCGTTT 

GAATTGGGGGCAGTCCCGTTTCCCGCTGGACGAACGCGACCGAATCCTCCAAAAGACGCCGCT 

GCTGTTCGACGTGTCGGTCTACGAGCTGTTCTGGGGCGCATGGAGCGGGGCCACCCTGGACAT 

CCTCGAGCCCGGCGCCGAGCGCGACCCCGACGCAGTGGCCAGGGCCCTGGCCGAGCGCGCCAT 

TACCGTATGCCATTTCGTGCCTTCGATGCTGCTCGTCTACTTGGAAGTCATGCGGCGGCACCA 

TGCGCCGCCCGTGCCGGACCGGCTCCGTTACGTCTTCGTCAGTGGCGAGGCCCTCGAACCGGA 

CCACCTCGCCGGGCTCCAGCAGATTGGTCGGCGCCTCGGCCGCACGATTCCCCTCGTTAATCT 

GTATGGACCAACCGAGGCCTCGATC3AAGTCTCCTGCTTCGCCTGTCCCGCCGACCATGTGCC 

GCGCCGGATCCCCATCGGGCAGCC3ATCGACAACGTCGCACTGCACGTTCTCGACCGGCGCGG 

CCGTCGCCAGCCGCCCTATCTTCCT3GCGAGCTGTTCCTGGCCGGCGACTGCCTGGCGCGCGG 

CTACCTCAACCGTCCCGACCTGAC33CGCTCCACTTCGTGCCCAATCCCTTCGGCAACGGCGA 
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GCGCATGTACCACAGCGGCGACTTGGCGCTCGTGCGCGGCGACGGCCAAGTGGCGTTTCTCGG 
CCGCCGTGACCACCAAATCAAAATCCGTGGTCAACGGGTCGAACTGGGCGAAATCGAGAGTCA 
TTTGCGCGGGCTCGAAGGCATCGCCGCCGCCGTCGTCCAGGCCGAGTCGCAGCACCATGAAAC 
CCTGCTGCACGCCTACGTCGTCACCAACGACGCGGGCCTCAATGCGGCCCGGCTGCGCGCCGC 
CCTCGCTCAACATCTGCCCGAGTACATGATTCCCCAGCGCTTCTCGCGGCTGGCCGAGTTGCC 
■GCTGCTGGCGGCAGGCAAGATCGACCGCGCCGCCCTCGCGCAACGTGCAACGCCGCTCGCCAG 
CGGCGCGCCCTTCGTGGAACCCAGCGGGCCCACCCAGCAGCGTATCGCAGAACTGTGGCGCCA 
GGTCTTAGCGGTCGCCGAAGTCGGCGCCGAGGATCCCTTCTTCAGCATCGGCGGCAACTCGCT 
CAATGTGCTCAAGCTCAGCGCCGCGCTGAGCGACGCCTTCGCGCGTGACATTCCCATGCCGGC 
CCTGTTCCAATACGACACCATCGCC3CCCAGGCCTCCTGGCTCGACGGGCAGGTTGACGAACG 
GGCCCAATCCGCCGCGCTCGACCGGCAGGCCGCCGAGGCGGCGCTGACCCTTCAAGAGACCGT 
GGCCATTTTTGAGGGATTCGATGACGAACCATGACCATCACGAGGAGAGCAGCGGCCTGGAGA 
TCGCCGTCATCAGCATGGCCTGCCGATTCCCGGGTGCTGCCGATTGCGACGCATTCTGGGAAA 
ACCTGATCAACGGGACCTCCTCGATCACCCATTTCAGCGACGACGAGCTGATCGCGGCCGGCG 
TTGACGCGCGCGACCTGACGCCGCAGTACGTGCGCGCGGCCGGCCAGATCGATGACGCCGAAC 
GGTTCGACGCGGCCTTCTTTGGGTACTCCCAGCGTGAGGCCGAGCTGATGGACCCCCAGTTCC 
GCCTGCTCCATGAATGCGCCTGGTCCTGTCTGGAACAGGCCGGCATCGATCCGCGCGTCGAAG 
CCGCGCCGATCGGGCTGTATGCCGGCGCAGCCGACAACACCTACTGGAACGCGCTCTCGTCGC 
TCGACCGGGGCTCGGCCGAATCGGAGCAATTCGCCGCCGAACAACTTTGCAACCGCGATTTTC 
TGTGCACGCTGGTCGCCGCCGCGCTCAACCTGAAAGGCCCCGCGGTGGTGGTTCAAAGCGCCT 
GTTCGACCTCGCTGTTGGCGGTCCACTCGGCCTGTCGTGCGCTCCTGACCGGCGAATGCCGAG 
TGGCCTTGGCCGGTGGGGTGGCGCTGCGCTTCCCACGCCCGAGCGGTTATCGCTACGAACCTG 
GCATGATCTTCTCGCCCGACGGGGTGTGCCGGCCGTTCGACGCGGGCGCTAACGGGACGGTGC 
CCGGCGAAGGCGCGGGGCTGGTAGCGTTGAAGACGCTGAAACGTGCCCTCCAGGACGGCGACA 
CGATCCACGCCGTGATTCGCGCGACCGCGGCAAACAACGATGGTGCCCGCAAGACCGGGTTCA 

ccgcgcccagcgcccacggccaag::cgaagtcattcgcacggcgctgcgcctggcccgggtgc 

CGGCCGAATCGATCGACTACGTCGAGGCCCACGGAACCGGCACGCCGCTAGGCGACCCGATCG 
AGGTAGCCGGCTTGGTGGAGGCCTrCGCCAGCGAGAAGCGCGGCTATTGCCGGCTGGGCTCGG 
TCAJJ^TCCAACCTTGGTCATCTGGACACTGCTGCCC-GCATCGCCGGCCTGATCAAGACCGTGC 
TGGCGCTQGAGCACGCGCACATCCrCAAGTCCTGCCACGTCGCCACGCCCAACCCCGCGGCGC 
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GCCTACACAAGACGCCTTTCCGCATTGCCGCCGACGGGATGGCCTGGCCGCGGCGTATGGCGA 

CGCCGCGGCGGGCGGCGGTGAGTTCGTTCGGCATCGGCGGCACCAACGTCCACGCGATTTTGG 

AGGAGGCGCCGCCCCGCGCGCCCGAGCTGGCGGACGGGCGCAGTCAGGTGTTCGTCTTCTCCG 

CCAAGGACGAGGCGGCGCTGGACCGTGCCCTTGCCAACTATGGTGCGGCCTTGGAGAAGCGCG 

GCGACCTCGCGGCGGGCGCGGTGGCCTGGACGCTCCAAAACGGCCGGGCCGCATTCGAATGGC 

GAGCCAGCGCGGTGGCATCCGACCTCGACGAATTGGCGGGCGCATTGCGCGGCGAGCGGCCCG 

GCGCCGTCAAGAAAAACCGAATGGCGCGCGAGGATAAC-CCGGTGGCGTTCTTATGTTCGGGGC 

AGGGGAGCCAGTACCGTGGCATGGGCCACGACCTGTACCGCGAAGAGCCGCGTTTCCGGCACC 

ACCTCGACGCCTGCCTCGCCATCCTCGCCGAACACAAGCCCGAGATCGACTGGCTGGCGTTGC 

TGGGCTACCGCGACGAGGACGAGCCAACCGACCAGATCGGGACGTCCTCGCAGGGCCCGAGCC 

GGTCAGCCGCATCGAACCCAGCGGAGCTCCTCGACAGCACCGAATTCGCCCAACCTTTGCTTT 

TCTCCATGTCCTACGCGCTCGGTCGGCTGTGGCTCGACTGGGGCGTGCGACCCACGGCGATGA 

TCGGGCACAGCCTGGGCGAGTACAGTGCTGCATGTATTGCAGATTTCTATGCACTCGATCAGG 

TGCTGCCCTTCATTCTGACCCGCGGTCGAGTCATGGCGCAATTGCGGCGCGGCTCGATGTTGG 

CCGTCAGCGGTGACAGCGTTCTGATGCGCGAGCTGATCGCCGATGCGCTCGATTTGGCGGCGA 

TCAACGGCGCTGACCAATTTGTCTGGAGCGGGCCGAGCGAGGCTGTCCAAGCCGCGGGGGTCC 

GACTGCGCGGCGCCGGCCTGCGTGCCACCGAGCTGAACACCTCACACGCGTTCCATTCAGCCA 

TGATGGATCCCATTCTGGAGGAGCTAACGGTTGCCGGTTCGCGACTTCAGGTCGGTGTCGGGA 

CGATTCCGGTCGTTTCATGCGTTACCGGAACCTGGTTGACGGCGAAGCAGCTGGCCGATCCGC 

GCTACCACGCGCGTCACGCGCGCGAACCGGTGCGGTTCGCGGCGGGCCTAGCGACGCTGACAG 

GGGAGGAGCCGCCGCTGATGCTCGAAGTGGGGCCGGGCTCGACCCTGGCGGCTTTGGCCCGCG 

AGCATTCGAATGCCCGCCTCCCGGTCGTCACCAGCCTGCGCCACGCTCGCCAGGCGACGCCCG 

ATCGCCAATACCTGCTCGAAACGCTCGGCTGCCTTTGGCGACACGGGGTTTCCGTCGATTGGG 

GGGCCCATGCCGGACGTTCGCGACGCTTGGTTTCGCTGCCCGGCTATCCCTTTTCCGGCGCGG 

TGCGCCGCTTAGCCGGCGACCCCCTCCGCCTGCTGGCCGGAGCCCGCGCCGTCGCCGCCCCGT 

CGGGAACGCGCCAACTCAGCGCCGACGCGCGCGACCTCCCGAACACTCCGGAGCCGACATCCG 

GCGCCGTGTCGGCGATGAAAGCGCCAATCGCCGCCGCCGATCCCGGCCTCTATCGCCTCTCCT 

GGCGCCAGGCCGGAACGGCGCCGCTCGGTCCGCCCGATCTCGGTCCGCCCCGCGACTGGATCG 

TCTTCGCCTCTGATTCTCACCTGCTCCAGGCGCTCAGGGCCAATCTCGGGACGCGCGCTCAGC 

GGGTGACGCTGGTGACGCCGGGCCAGGAGTACGCAGCCGAGCCGTCCGGGTTTCGGCTGCGGC 
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CGGACCAGATCGACGATTACCGCGCCCTGTGGGCGGACTTGGCGCAAACCGGTATTGTGCCAC 

GATACATCGCGTTCCTCGCCCCGTTCATGTACCGGGCGCGCATGGCGGGCGATGCCTCGACCC 

TGGACGAAGTGCGCGAGGGCGGCTTCCTGCCCCTGACCCGCTTGATCCAGACTCGCCCGCCAG 

GCGGACCGAGCGGACTTCTAAGCCTCACGATCGTCACCCCGGCCGCCCTGGCGCTGGGCGACG 

AAGCGACGCGCCCGGAATGGGCAATCCTGCACGGGATGGTCGCCGGCTTAAGCCGCGATTATC 

CCGAATGGCGCTTCGTCTCGATCGACGGCGGCGACCCATCCCCGCATCGGTGCGAAGGTCTGG 

CCCGCTTGATCGCGCTTCATGCGGTCGACGAGGCTGGCCCGACCCGCTTGGCGCTGCGCGGCC 

TTCACGCTTGGGTTCCACAGTGCGAGCACGTTCAGCCGGCCACCATCCCTGGGGCGGGTATGT 

GGCGCGAGGGTGGTGTGTACATGATAACGGGCGGATTCGGCGGGATCGGTCTGGCGCTGGCCC 

GCGCCCTGGCTCGAGAAGCTCGCGCCAAGCTGATCCTGGTCGGCCGAAACCTGCCCACCGCGC 

CGATCGATCTCGAGGCTTGGGACGCGCCGCCGTTGATTCTCACCGCCGACGTCGCCGACGAAG 

AGGCCATGCGCCGCGTCTTCGATGCCGCGCACGCCCGGTTCGGCGCCATCGACGGCATTCTTC 

ACGCGGCCGGTGTCCCCGGTGGCAGCCTGTTCGCCAACCAATCGGACGCGGCCTTCGAAGACG 

TGCTGCACGCCAAGGTTCGCGGTACCCTCGTGCTGCAAGGCCTGAGGGCAATCGATGCGCCGC 

TGTTGCTGATGTCCTCGCTGGACGCCTGGCTTCCCGGTCCCGGTCAGACCGCCTATGCCGCCG 

CCAACGCCTTCCTCGACGCCTTCGCCAGTCTGCGCCGGCGAGAGGGAGAGCCGGTGTACAGCG 

TTGGCTGGGACAGTTGGTGCGAGGTGGGCATGGCTGCTCGGGTCGCTGCCCGATCGGCCGACG 

AACGCGGCCGCCTGGCGCGCGAGGGGATCAGCCCTCGCCAGGGTTGGCAGGCTTTGAGCCGGG 

CGCTCGCCCTCGACCCCCCCCACCrGATGATCTCGCGCACCGACCTGACCTCGCGCTGGCACA 

GTCGATCCAGCCCTACGCCGGTCGCCTCGAGCGAACCCGAGGTGGCGCTGCCGCGCTGGACCG 

CATCCGCCTGCCAAGCCGTCATCGAGCGTGTTTGGTGCGAGCACTTCGCCACCGCCGCCGTGC 

CTCCCGATGGCAACTTTTTCGAGCTCGGCGCCAGTTCCTTCGACATCGTCCAGCTCAGCGCTC 

GACTTCAACAACAGTTCGGCCGAGATGTCAGCCACACCGTGCTCTACAGTCATCCCACCGTCG 

CCTTGCTGGCCGGCTACTTCGCCAATGACCCGACGCCGTCCGGTGCTGCTGCCGACGAACGCG 

ACGAAGCGGTGCGTCGCGGCCGCGACCTCTTGAAGAGCCGCCGGCGAGGAGTATGACCGTGGA 

GCACGAAACCGGATTCGAAATCGCCGTCATCGGGCTGGCTTGCCGCGTTCCCGGCGCTGCCGA 

CGTGGCCGCCTTCTGGCGCAACCTGGTCGAGGCCAAGGAGAGCGTGCGCTTCTTCGAGGACCA 

CGAGCTGCGGGCCGCCGGCGTGCCCGAGGAGATCTTGCGCCTGCCCAACTACGTGAAGGCCAA 

GCCACTGCTCGCTGATGGCGAAGCTTTCGACGCGGACTTCTTCGGGTTCCATCCGCGCGAGGC 

CGCCTACCTGGACCCGCAAGTTCGGCTCCTGCACGAATGTTGTTGGACCGCGCTGGAGGATGC 
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CGGCTACGATCCCGCGCAGTACGCCTACCCGATCGGGTTGTTCGCGGGCGTCTCCAGCAATCT 
CTCGTTCCTGTTCGACCGCATCGATCCGCGCGACTCCCCCCTGCAGAAGCGCTATGTGGCCGA 
GCTGAACGCGGCCTCCTTCGCCACCCAGATCGCCTACCGGCTCGATCTGAAGGGGCCGGCCAT 
TTCGATTCAAACCGCCTGTTCGACGTCACTGGTGGCGATTCACCTGGCGGCGCAAAGCCTGAT 
CGGCGGCGAGTGCCACATGGCCTTG3CCGGCGGAGCGACCTTGGAGGTCCCCAAAAAGCCCGG 
CTATCTCTACCGCGAAGGCTACATCAACTCGCCGGACGGCCACTGCCGGGCCTTCGACGCCGA 
CGCGGCCGGCACCATCTTCGGCGACGGCGTCGGCATCGTCCTGCTCAAACGCTACCGCGACGC 
CCTACGCGACGGCGATCACGTGTAC3CAGTGATCAAAGGCTCGGCGATCAACAGTGACGGCCA 
TCGCAAGGTGTCCTACACGGCGCCG3GCAAGAGCGGTCAAGTGGCGGTGATCCGCGCTGCGCT 
GGCGGCGGCCCAGGTAGAGCCGCA^.^CCATTCGCTTCGTCGAGGCCCACGGGACCGGCACACT 
CGCCGGCGATCCGATCGAGGTAGAC-GCGTTGACC<3AGGTCTTTGCCGAAGCGGGTCGCGGTAC 
CTGCGCCCTGGGTTCGGTGAAGACCAACATCGGCCACTTGGATGTGGCGGCGGGCGTGGCCGG 
TTTCATCAAGGCGGTCTTGGCGCTCGAGCGGCGCGTCCTCCCGCCCAGCCTTCACTTCGTCCG 
GCCCAACCCGGCCATCGATTTCAACGGGCCCTTCTACGTTTGTCGCCAAATCGAGCGGTTGAC 
GGAGAACGGGCGGTTGCGGGCCGGGGTGAGTTCCTTTGGCATTGGCGGCACCAATGCCCACGT 
GATTCTGGAGGAAGCGCCGGCGCCGGAGGCGAGACTGCCGGCCGGGAGCCCGCCAGGCGCGAG 
TCCGTTCCTGTTCCCGCTATCGGCCAAGACGCCGGATGCGCTGGCAGGCCGTTGCCACGACCT 
TGCCGACCACCTGCGGGCGCACCCC3AGCTCCTCCTGGCCGATGTGGCCCTCACTCTGCAGAT 
^GGGGCGGGCGTCGTTCGCCTACCGCCATGTGGTCCAGGCTGCGACGGCGGAGGAGCTGATTCG 
CGGTCTGGGAGCGTTCCGACAGGAGrcCATCCGCAAGAGGCGGAATCGAGTACAATGGGTGTT 
GGCAGGCGAGGCGATGTCGCTTGAG3CCGGTTTGCGGCTGTACGCCGATTGGCCGGTCTATCG 
GGAGCGGGTCGACGTCTGTCTGGCGATCGTCGCCAAGCTGCGCCAAATCGACGGCCGGTCATT 
CCTACATGAGTGGATCGAGCGACCGCGCGAGGTTCCTGCCGAATGGTCGACGGCGCTGGCGTT 
CATGTTCCACTGCGCGCTGGCGCAA3CCCTGAGCCAGGCCGGCCTGCACCCGCAGCGCATGTG 
GAGCCGTGGGCTGGGCGGACAGGTCGGCGTGGTTTTGGCCGAATCCCTGTCGTTGGAACAAGC 
GCTGGCGCTGGTGTTGTGCCAGACACCGGTTCCCGGCGATGCCACACCTCAGCGCGAACGCTT 
GGTTCGGACACTGGAAGGCTGCCG3TTTCGTCCACCACGATTTTTGATTTCGGCAGACAGCTC 
GGGTCGACCCCTGGACCTCGCCGAJ^.TTCGCTCATGTCGATTTTTGGTGCGGTGGCCAAAGCGC 
CTCGCCCAATGAGGCGGAGCTGCGCrCATGGAGCGACGCCGCGCCCGAGCTGGTGACCTTGGC 
GATCGGCCCATCCTTTCTCGAGGC33CCTCCGGGACGGTGGGTCTGGCGATCGACCCCAAGCG 
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ACCGATGACCTGTGTTCA3CGCACGGTGGCCGCGTTGTGGGAATGGGGATGTGACGTGCGCTG 

GGCTGCGTTCACCTCGTCGACCGGGCGTCGGGTTCCCCTGCCTACCTATCCCTTCGTGCGGGT 

AATTCCCACGATCGGCGACCCCCTTCGCGGAGCAGGCGCGGAGGATGACTTGATTGCGGCGAG 

CGCTTCCGCGTCGGCCGGATCGCCGCCCGAGCCGTCGGCAAACTCGGCAGCGGAACGCCCACG 

CGCCCAGTCAAGCATCGCCTCGGCAACCACACCGGCTCCGTCTCATACGTCGGCCAGCGTGGC 

CGTGGCCACCATTCTCGAAACCGTCCGTGCCTATTTCGGGTTCGCCGCCGTGCGTTCCACCGA 

CGCCTTCTTCGAATTGGGCGCGTCCTCGCTGGATTTGGTCAACCTGGGCCAGCTCCTTTCCGA 

TCGTCTCGGCCGCGAGGTTCCGACCCTGCTCCTCTACGACCACCCAACACCGGACCAGTTGGC 

GCTGGCCCTGACATCCGCGGCGCTCAGCGCAGAGGCGCCGCCCTTAAGGGGCGGTCATCGCGC 

ATCGACTTCCGGCACAGCCGCGAGCTCGGCCGCCTCCACCGCACCGACGTTCCCGGGGGACGC 

TCACTCGCAGCCCAGCTTCGTTCGCGAGCAGGACATCGCCATCATCGGGATGGCCTTCCGGGG 

ACCGGGCGCCGACGACCTGGACGCGTTCTGGAACAACCTGGTCGAAGGGGTCGAGTCGATCAC 

CTTCTTCAGCGAGGACGAGCTGCTGGCGGCGGGCGTCCCCCGCGAACATCTGGCGTCGACGCG 

CTACGTGCGGGCCAAGGGGGAACTGACTGGGATGATGGATTTCGAACCGGAATTTTTCGGTTA 

TTCGGCGCGCGAGGCGGCGGTCATGGACCCGCAGTTCCGCGTGTTCCACGAATGCTCCTGGCA 

CGCACTGGAGCACGGCGGCTACGATCCGACCCGATGCGCGGCATCGATTGGCGTCTACGCCGG 

CGTGACCAACCACCTGCCTTGGCTGATGCGAACTTTGCCGCACCTGACCGAGGAGGAGCAATT 

CGGCGCGCTGCTCCTCACCGACCGCGAGTTTTTCGCACCGCTGCTCTCCTACAAGGTCGGCCT 

GCGCGGACCCGCTATTTCGCTGCAAACCGCCTGTTCGACGTCGTTGGTGGCGATCGGCACGGC 

CTGrCGCGAATTGCGCGCGGGTGCCTGTCAGATGGCCCTAGCGGGCGGCGTGACGGCCAGCAT 

CGA3CGCTGCGGCTACTTCCACCAAGAAGGCTACATCCTCTCGCCTGACGGCCACACGCGCAG 

CTTCGACGCGGCGGCCGCCGGCACGGTCTTCGGCGACGGAGTCGGCATGGTGCTGCTGAAGCC 

GCTGGCCCAAGCCTTGGCCGACGGCGACACGATCCACGCGGTGATCAAGGGAATCGGCATCAA 

CAACGACGGCGCGCGCAAGGTCGGCTTCACCGCACCTAGCCGGGCCGGTCAGACCGAGGCGAT 

TCG3GCCGCGCTGCGCGACGCCGGGGTGGCGTCGAACCGCGTCAGCTACGTGGAGGCGCATGG 

AACCGCGACCAGAATGGGCGACCCGATCGAGGTCGAGGCCTTGACCCAAGCCTTTCGCGCCGA 

AGCCGACGGTCCGCTTCCGCCCGGCTCCTGCCTACTCGGCTCGGTGAAGTCCAACGTGGGCCA 

CCTGAACGCCGCGGCCGGCGTGGCTGGTCTGGTAAAAACCGTGCTGGCGCTCCAACACCGCCG 

CCTGCCGACCAGCCTGTTCTACCAGTCGCCCAATCCACACATCGACTTTGCGGCGAGTCCGTT 

CCGCGTGAACGGCCAGACTTCGGATTGGGTCGCGCCAGAGGGGACGCGGTTGCTGGCGGGAGT 
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GAGTTCGTTCGGTATCGGGGGAACCAACGCCCACCTGATCGTCGAGGAGGCGCCGAAAGCGCT 

ACCGACGACAGCGGCACCTCTGTCGACGGAGCCGAATGACCTCGACGCGGGCGACGCCGACGG 

GCTAGTGCTGCCGATCTCGGCCCGCACGCCGACCGCCCTGGCGCACATCGCGACCAACCTCGC 

CAATCACCTGGAACGACATCCGACCATCGCCCTGGCCGACGTCGCCCTGACCCTTCAGCTGGG 

CCGTCGCCAATGGCCCCATCGCCACAGCCTGATCTGCCGGAATCGAACGGAGGCGATCAAGCT 

GCTGCGCGCCGTCGTCCACTCCGCGGAGGTGCCGCCAGCTCAGGCGCCGGTCTCGGATGCGCC 

GCGCTGTGTTTTTCTTTTTCCCGGCCAGGGCGCCCAATACCCGAGCATGGCCCGCGACCTGGT 

TCGAAACTGTCGCGACTTCGCCCTGCACCTGGACCCCTGCCTCGACCAGTTGGCCGAACTGCT 

TCCCGAAGATCCGCGTTGCATCCTGTTCGGCGATGGCCCCGCCGATCGGCTCGACCAGACGGC 

CTACACTCAGCCGCTGCTCTTCTCCGTGTCCTACGCCTTGGCGCGCTGGTTGGGCGATTTCGG. 

CATTCGCCCCGATGCGATGATCGGCCACAGCCTGGGCGAATACGTGGCGGCCTGCTTGGCCGG 

GCTTTTCTCGCTGAGCGATGCCCTGCTGCTGGTGAGTGAACGCGGCCGCCTGATGGGCTCGGC 

CGCGCGCGGAGCGATGCTGGCCGTCCCCTTGCCCGAATGGGAACTGGAGGAACGCCTGGAGCT 

TCTGGCCGACGACCGAATCAGCATCGCGGCGGTCAACACCGCCGAGAGCTGCGTCATCGCGGG 

ACCCAGCGAGGCGATCGAGCGCTGCGCCCAGCGCTGGGCCGCGCAAGGCCTGACCTGTACGCC 

GCTGCGCACGTCCCACGCCTTCCACTCCGCGATGATGGAGCCGATTGTCGAACCCTTCGGCCA 

TGTCTTGGCACGGGTCACCTTCGCGCCGCCGCGCGCGCGCTGGATCTCGAACCTCGACGGCAA 

GCCGATCGATTCCGCGGCGGTGATGCAGCCCGACTATTGGGTGCGCCACCTGCGCCAACCGGT 

CCGCTTTCACGAGGGACTCAGTCACCTGTTGGCCGAGGACACCCATGCTTGGGTCGAAGTGGG 

TCCCGGCCGAACCCTGTCCTCCTTCGTCCGCCGCCACCCGGCCTACCGTCACCAGCCAATCGT 

CAACCCCATGCGCCATGCAGTCGAGTCGACGGGCGACGTGCGCCGGTGGCGCCAAGCGCTGGG 

CGAACTATGGCGGGCCGGCATGCCGGTCGCCTGGGAGCGGCAGCGGCGCGGCCGGCATGCCGG 

ACGACGTGTGCCGCTGCCGGGCTACCCCTTCGAGCGGCGGCCCTTCGCGGCCCGAAGACCGGT 

GGAGCTGGCGCAGCCCGCGCCCAAGGCGGAGCTGGTGAAAAACCCCGATCCCGCGCGGTGGCT 

GTACCGCCGCGTCTGGCGCCCTGCCCAGGCTGCGGCCGGCGGACTGGCGGTGCAGGCGACCGT 

TCTGGTCTTCGGCGACGGGTCCGAGCTGTGCCGCGCGGCGGTCGCTCAGGTGCAGCGCCAGGG 

GCTGAAGTGCGTCTCGATCACCGCGGGCCGCCAATTCGCGCGGGAGAGCGACATGCGCTTCAC 

GCTTGACCCCGCTGATCCGCGCCAGCTCGACCAGCTCTTCGCGGCCCTCGATGGCTCAGGCTC 

GCGGCCGCGGTACGTCCTGCACCTGCTGACCCTGAACCCGCCCCCGGATGCCTCGGCGATCAT 

CGCTCACAGCTACTACAGCCCGATGGCCTTGGCTCATGCCTTGGGCGCCCACGAGATCGCGCC 
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TGTCTCGATCACCGTCGTCACCGCCGGGGTCGTCGCCGTCGCGGACGAAGCGATTCGCGAGCC 

GCTGCAGGCGCTGATCGTGGGCCCGTGCCTGGTCATCCCGCAGGAGTTTCCCGGGCTCAGCGT 

TCGGCTGCTGGACGTCAACGTCGACGATCCGGCACCGCGTCTGGCGGAGCGGCTCGTGGCCGA 

GCTCTCGGGCACGGATCACATGGTC-GCGCTGCGCGGCGGCGAGCGCCTAGTGGGCGATGTCGA 

TCAAGTCGATGGCCTCGGTGTGGGGATCGCCAAGGTGCCCTTGCGCCGCGAGGGCCACTACCT 

GATTCTCGGCGGCCTGGGCGATATCGGCTACCACTGTGCCCGCTATCTGGCCCAAACCTACCG 

CGCCAAGCTGACGCTGACCGCGCGTTCGTCACTCCCGCCGCGCGCGTCGTGGGAGCGAATGCT 

GCGCGAGGGAAACCTGGATTCCCGGCAGCGCACGCGCATCGAGCGCGTGTTGTCGCTAGAGGC 

GTGCGGGGCCGAAGTCCAGACGGCrCCGGTCGACTTGGGCGATCGCCATCGCTTGGCCGATGT 

GTTCCGCGAAGCACGGGGCCGATTCGGCGCCATCGCGGGCGTGATTCACTCGGCGGGGATTCC 

GGGACACGTCCACTCGATCGACGAGCTGGTGCGCGTCCGCGACGAAGCCCAATTCACCGCGAA 

GGTTCGAGGGCTGCACCACCTGGCCGAGGTCGTCGATCCGCTGAACCTCGACTTTTGTCTGCT 

GTTCTCCTCGCTCTCGACCGTCCTCGGCGGGCTCGGCTACGGCGCCTATGCAGCGGCCAACGC 

CTACATGGACAGCTTCGCCCGCCGCCACGATCGGCCGGACGAATGTCGTTGGATCGCGGTCAA 

CTGGGACGCCTGGCTGTTCGAAGCCAAGACGTCGTCGGTCGGCGCCGAATTGGCGCGCCTGGC 

GATCGTGCCCGAGGACGCTCCGGCCCTGTTCGCGCGGGTGCTAGAGCGACTTCCGCAATCGTT 

CATCGTGTCCACCGCCGACCTTCGGGCCCGCATCGACACTTGGATCCGGGACAAGAACCGCGT 

CCCGCCCGCCGAGATCCGAGCGGTTCAACCGCGACCGGACCTGAGCCAGGCGTACGCCCCGCC 

GATCGGCCCGCTGGAGATTCAACTCTGCGGGCTGGTCTCCGCCTATTGCCGGTTCGACCGGAT 

CGGGCGGGACGATTCCTTCTTCGAAATCGGCCTCAGCTCGTTCGACTTGATCCAGCTCAGCTC 

GCGCATTCACCGCATCACCGGCAAGGATCTCAATACGACCCAACTGTTCAGCTACCCCACCGT 

GCGCGCCTTGGCGCTCTTCCTCGGCGGCGAACCGGAGGGGCTCGCGGCGGAGGAGCCCGCCAT 

GGAGAACCTGTGGCTGCAACGAAGCGATGCGACCCTCGATGAGTGAGACCGAGGTCGCCGACT 

GCGGCGCTACCGACCGCGGTCGAGGATTTTCCGCGCAGCGATCCGGGACGACTCGCTGAAGAA 

GCGCGATAGAAGAACGGAATCGTGTATGAAATACGAAACCACCGGATTGGAATTGGCCGTCAT 

CGGTCTCGCTTGCCGCTTTCCAGGCTCACCCGATCCCGAACAGTTCTGGTCGAATCTGCGCGC 

AGGTCGCTCCGGAATCC<5CCAnTCAGCGATGCCGAGCTGAGCCACATCCCCGCATCCCTGCG 

TCACCATCCGCATTACGTCAAGGCCAAAGGCGCGCTGGACCACGCCGATTTCGAACCAGCCTT 

CTTCGGCTACTCGCCCAAAGAGGCCGAGGTGATGGACCCTCAATTCCGGCTGCTCCATGAGTG 

CTGCTGGGAGGCGCTGGAGTCAGGCGGCTATGCGCCGAGCCAATTCGCGGGTCGGATCGGCTT 
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GTTCGCGGCGGCGGCCTTCAACGACGGATGGATCGCCGGTACCCTCGACCGGCTGCGCACCGG 

CGTGGGTTTGAGCTCCCTGGAAACGGCGTTCTTGACCCTGCGCGATTACCTGACCACCCAGAT 

CTCCTATCGGCTCGATCTGCGGGGCCCCAGCCTGCTTGTCCAAACCGCCTGCTCGTCGTCGCT 

GGTGGCGGTCCAGCTCGCCCAGCAGGCGCTGATCTCCGGCGAATGCGCCCTGGCCTTGGCTGG 

CGGCGTGTGCGCGACCGATCCGCTGCATTCGGGATACCTCTATGAACCCGGCAACATCTACGC 

GCGCGACGGCGTCTGCCGACCGTTCGACGAGGCAGGCGCCGGTACGGTCTTCGGCGACGGGTG 

CGGCATGGTCCTGCTCAAGCGGCTGAGCGACGCCCAGCGCGACGGCGATACGATCTGGGCGGT 

CATTCGCGGGGCGGGCGTGAACAACGACGGGCACCACAAGGTTGGCTACACGGCTCCTGGCAC 

GAGGGGCCAGGTGGCTTTGCTTAAAAGTGTTTATCGCGCGAGCCGGGTCGACCCGGCGACGCT 

CGGCTACCTGGAGGCCCATGGCACCGGCACCGCGCTCGGCGATCCAATCGAGGTCGAGGCGCT 

TACCCAGGCCTTCGCCAGCAAACGTCGCGGCACCTGCGGCTTGGGCTCGGTCAAGGGCAACCT 

GGGTCACCTCAACACGGCGGCCGGCATCGCTGGACTGATCAAGGTGGTGCTGGCGCTGAAACA 

TCGCGAAGTGCCACCCACCCTCAATCTGCGCCGTCCCAATCCGAAAATCCGCTTGGACGAGAC 

GCCGTTTTTCCCAGTCGTCGAGTTGCAACCCTGGCCAAGCGGGACCGGCCCCTTGCGAGCCGG 

CGTGAGCTCCTTCGGCATCGGCGGTACGAACGCCCACGTCATCCTCGAGGAGGCACCGCCGAC 

GGCCAACCCGGCGCCACACGGCAGATTCCGACTGTTGCCGCTTTCGGCCAAGACACCGGCTGC 

GCTCGAAGCGAAGCGCCGCGATCTGGCCGGCTTCCTCGAACGCCACCCGGAGACCTCCTTGGC 

CGACCTCGCCTTTACCCTGCAACGCGGCCGCGAGGTCTTCAGTCACCGCGCCTGCCTCGCCGT 

GGAGACCTTAACGTCCGCGCGCACGCGGCTGAGCGGCGAGTCGTCGAGCACTTGCGTGGTGGG 

CCCCGCGCCCAGCGCCATATTTCTGTTCCCTGGTCAAGGCAGCCAGCTCGCCGGGATGGGCCG 

CGGTCTGTATCACCATTTCGAGCCGTTCCGCACGGCCGTCGATGCCTGTCTGCGCGAGCTGGA 

GCCAGGACTGCGGCAAGCGCTCAGCGCCCATTTCGATCCGAATCGCGGCGCGGACCCACCCGA 

TTCGACGACCTTCGTCCAACCCTTGTTGTTCCTCGTCGAGTACGGGGTGACCGAGTGGCTACG 

CTGCTTGGGTGTGCGGCCAACAATGGTGTTGGGTCACAGCTCTGGCGAGTATGCCGCAGCCTG 

CGTCGCGGGCGTTCTGTCGCCGTCCGCGGCGGTCTCGCTGCTGGCCGAGCGCGAGCGGCTGCT 

GCGCGACCTGCCAGCCGGCGCCATGCTCGGCGTCCCGCTGGCCGCCGAGGCGCTCGAGGCGAT 

GTTGCCCGACGCTCTCGATCTGGCGGCGATCAACGGCTGTCAGCTTTGCGCCGTGTCCGGGCC 

GGTCGCGGCGGTCCACGCCTTCAAGGCCCAACTGGAAGCCGCCGGACATCACGCCCGCCTGTT 

GCACACCGATCGCGCCTTCCACTCGCGGCTGGTAGCACCGGTGCTTGACCGGTTCCAGGCAGC 

CGTTCAACACGTGGAGCTGCGGCC-GCCGCAAGTACCTTACCTCTCGACCGTCAGCGGGCGATT 
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GGAGGCGGATGGGCCGGCGAACCCGCACTACTGGGTGCGTCACCTGCGCGACACGGTGCGGTT 

TGGTCCAGCCCTGGAGGCGCTGCCGCCGGTGGATTCCTTCGTGTGCATCGAGGTGGGACCAGG 

CTCGGCCTTGAGCACCATGGCGCGCGAAACGTTGGGTTCCCAGGCGCGACTGATTTCGTTGCT 

GCCGCGGCCGCGAACGGGGCAAATCGAGCCCGGTCCGGTATTCGAACGACTGGCGGCGCTTTG 

GCGCAGCGGGTTGACATTGGATTGGTCTAAATTGACGGGCGGCGAAGAGGGTCATCGAATTCC 

CTTGCCAGTCTACCCGTTTCAGCGCAGCCATCTGTCGAGCTCCCTGGCGGCGGGCCACACGCC 

TTCGTCGCGGCCTGCAGTCGAATCAGGCGCCATCCTTGCCGAGCGATCCGCAGGGGAAAACGC 

TGAAACCCGGGATTGCCCGCTGCCAACCGCCACGCTCGAGCCCAAGGCGGTCGCTCCGGCCCC 

ACTCGAGGCTACCGACGCCGCAGGTACTCGCGAGCGACTGGCCGAACTTTGGCGCGAGTTGCT 

AGGGTTGACCTCGATTGGGCCCGACGACCATTTCTTCGACCTGGGCGGCCACTCGCTGACCGC 

CACGCGGCTGCGCGCCCTGATTCACCAGCGGTTCGATGTCGATCTCGGGCTCGACGAAATCTT 

CGCTCATTCGCGTCTCTCCCAGCTGGCCGCCCGTATCGAGGCGGCGGCCAAGAGCCGATTTTC 

CTCCATTCCCAGCGCGCCGGACCAGGACGACTATCCCTTGTCATCCGCCCAGCAGCGGATTCA 

CAGCATCGTCACGAGGGCCGAGGTCGGCACTGCTTATAATTTTCCGATCGTCCTCGAGCTGCA 

GGGCGCTCTGGATCGAGTGCGATTCGAGGCGACGTTCGCGGCATTGTTCCGGCGTCATGAGGG 

GTTCCGCACCCGCTTTGTGATGCGCGATGGCGGGCCGCGCCAGCGCATTGTACCGGACGTGGC 

GT7TCGCCTGCCGCTCACCCAGGTCGAGCCAGAGCAGGTTCCCGGGCGCATCGAGGCCTTCAT 

CCGTCCCTTCGATTTGGAACGCGCGCCGCTGTTCCGCGCGGAGCTGTTGCAGTTGGCCGAGCA 

GCGCCATCTGCTACTTTTCGACATGCACAACTTAATTGCCGACGGTATCTCGCTCAACCTGTT 

CGTCGCCGATTTCGCGGCCCTGTACCATGGTCGTCCGCTGGCGCCGCTGAAACTCCGCTATCG 

CGACTATGCCGTTTGGCAAGAGGCGCGGCTGGCCTCCGATGACCTGCGCAGCCAGCGCGAATG 

GTGGCACCGGCGGCTTTCGCCGCCGGTCGCCACGCTGGCGCTCCCTCCCGATTTCCCGCGTCC 

GGCGGTGCGCCGCTACAAGGGCCGTAATGTGGTGTTCCACCTGGACCGGGAGATCCGCGACCG 

CCTGGTGGCCCTGGCTCGAACCCAGGGGGTCACCATGAACGTGATGATGCTGGCGCTCTGGGC 

TGCGCTGCTGCATCGCGAAACCGGCCAATCGGAGCTGGTGGTCGGATCGCTGCTCGGCGGGCG 

GCCGCACAGCGAGCTGCATCCCGTGATCGGGCTCTTCACCAACTTTTTGCCCTTGCGGTTGGC 

GGTCGAGGGATCGACCCGCTTCGATCGCTTCCTTGCCGCTTGCCACCAGGTGTTTCTCGAAGC 

CTATCAGCGCCAGGACTATCCGTTCCACTTGTTAGTCCAGGAACTCGTGCCGGTCAGGGACCC 

GTCGCGGTCGCCGCTGTTCCAGACCTCGCTCGTCTACCACAACGAAATTGACGGCAAGACCAA 

GCTGGAATTGGAAGGGCTGAAAGTCGAAGTGGTTCCCTTCGAAAAGGGTGTGGCGAGGCTGGA 
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TTTGAAGCTGGATGTGACACCTTTTTCCGACCGACTCGAATGTGTTTTGCAATACGACTTGGA 

TCTGTTCTGCGAGGAGACGATGCGCGGCCTGATCGCGCGGTTCCAGGCGTTGGTGGCGGGGCT 

TGTCGCCGATCCGGCGCAATCGCTCGCCGCCGCGAGCGTTTCCGGGAAGCGGGCGCTGCGCGC 

GGGCGTGGCCACGGCAAGCGAATCGTCGCCGCAGTCACTGCCGCCGCAACCATCGACGGCGTA 

CGCCACTCCCTCACCGCAGTCACCGTCGCCGGTAGTCCTGACGGGACCCGCCGACCTGCCCGC 

GATCTTGGCGGCCTACGTGGGGCAGAACCCCCATCCGTTCGCGATCCATCGGGGTCTCATTTT 

GGAGGCGCCGCTGGGGTTGCGAGCGCTGCGGTCGGCGCTGGACGCAGTGCTCGGAGAACACAC 

CCATTGGCGCAGCGTGCGTGCGGGCGATCGCGCGCGGCGCGTGGATAAGTTGGAATTGACCAG 

CCTGGTGCGGCTCGACGACCTGCGCGGGTTGGTCAATCCTCAGGCGAATGCCTTCACCCTGGC 

TTGGCGCGATCTGGCGATGCCGTTCGGGGAGGGGCGTCCCCTGTGGCGACTCCGCCTGGCGTG 

GTCGGCTCCATCGCGCTGG^rTGCTATTGCTGACGGTTCATCCATTGATCGGCGACAACGGCAC 

GGTCGACCTCTTTCTGGCGGCACTCGCCGATCACCTGCGCCGCGCGTCCGCTTTTCCCGTAGC 

ACCGCTCGATGAGGCCGAGCTGGAGGCGGAGCTGAAGTGGGGAGAGGAAGGGGAGGGCCTCGG 

GCTGACCGCGATCGCGCCGGTCCTGGGCCAATTGCGCGAAAGTCGGCTGAGTCCTGTGGCCCA 

GATGTGGCTGGACGAGGTCTGTCGCCGCCACGACCTCACCCCGCTAGAGGTCTTGGCGGCCCG 

GCTCCTCGATTGGACACGAAGCCACGGTCACGGGTCGATCGCTTTGTGGACGCCGCTGCCCGA 

GGACCATCCGCTTCGCGATGAAGGCCGCTGCCTCCAGGTTCGCCTGCTGGAGGGGCCGCCGTC 

GCAGCGAGGAGCGGGCGATCCT^GCTGGCTCGAGCAAATCGCCTTGAGACGGGGTACCCCTGC 

AACGGAGGTCGTTTGCCCTACTCCGACCCAACGGGCAGCCATCGACCTCGCGCTGGCCTGGCT 

GCCGCAGCCGCCTCTTCACGGTTTGGTCGGAACCGTTCAGCCGTGGCCGGAATCTCCATTGGT 

CTGTCCGTTTCCCCTCAATCTCGCGTTCCGGCCAAGCCATCCAATTGCCTACGCGCTCAAGCA 

CGAGGCCACGCTCGCGGTCACGGCACGGGCGCGCGATCTGATGCGTTTCCTCGACGGCTTGGG 

CCCGGAAAGCTGAAGATTAGCATAAGCGCCCGGCCAAGGGCATCCTAGGATGACGCAAGCCTC 

GGCCGCGTCGACGTCCCAGGTCGCGCCGGAGGTCACCCCCGGCCGAAAGGACGACGATGACGA 

TCAAATCCGAGATGTCGGCCGTTGCTCACTCTGCGGAGAGCGGCTTCCGCGCTGGGCCACGCG 

TGGGCGGCGCGATGAAGCGGGGCCGGACGCCGGAGCAGGCCGGCGTGAAGCTGCTCCGCGCCC 

CGGTGAAGCGGAAGTGGCTGCCCCCGGCGCCCGTCCTGCGCCTGAGCGAGCGGCGTATCCCGG 

AGGTGTGGGCAGGCTACCGCGCGAGCGCGGGATGACCCGAGCCCCGCCCGCCGGCGCGACCAT 

GACGCCGCCCCACGGGGCGAGTCGTCCGGCGCGCCGGCGCGCGTCGGGGCTTCCGCCGCCGGG 

CGGGCAGaXGCAGGATGGTCGGGCATGGTGACGCGTCCGACGTCCGACGGCATCGAGGACGAG 
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CTCGCGCCGTTCCCCCCGGTCCTGCGCGGCTGGCTCATCGAGGGCGAGCTCGGCCGCGGCGGG 

ATGGGGCGGGTGTTCCGGGCGCGGCACCCGAAGACGCGGGCGCGGGCGGCGATCAAGGTGCTG 

CTCGGCGACTACGCCCGCCGGCCGGACGTGGTGGCCCGCTTCCGGCAGGAGGCGATCGCCGTC 

AACATCATCAACCACCCGGGAATCGTCCGCGTCTTCGACTCCGGCGAGCTCGAGGACGGCTCG 

CCCTACATCGTGATGGAGTACCTGGACGGCCGGGGGCTGCGCGACTGGGTGCAGGCCGTGCCG 

CCCGCGGAGCGGCCGCGGCAGGTCGTGCGGCTCGGCTACCAGATCGCCTCGGCCATGGCCGCG 

GCGCACGCGTCCAAGGTCGTCCACCGCGATCTGAAGCCGGAGAACATCATGGTGGTCGAGGAC 

GAGCTCGCGCCCGGGGGCAGCCGCGTCAAGATCCTCGATTTCGGCATCGCGAAGGTCCTCTGG 

GGAGGTCTGCCCGAGGTGCTGGAGCTCGAGGGGAGAGGCTCCCTCGCGCCCGCGTCCGCGTCC 

ACGATCCGCACCGAGCTCTCGACGCGGCCGGCGCCGACGGTGGGCGCCACGACCGGCCCAGAG. 

AGCCCGCTGGGCGCGAGCGCCACGCCAGAGAGCGCCCTGGGCGCGAGCGCCACGCCAGAGAGC 

GCCCTGGGCGCGAGCGCCACGCCAGAGAGCGAGGCCCACGAGGAAGACGCGCTCCGGAGCCTC 

CCCGTCGTGACCAGCGGCAGGCCCGCGATCCACCCCGCGCCGGTCGAGATCCCGGCCGAGGCG 

GTCTCCTCCGCGGCGTCGCGCGGGTCGCGCGCGTCGATCGAGCCAGGCGCGCCCGCGCCGCAG 

AGCGAGGGCGCGGGACAGCCCACGATGCCGTTCACGCAAGAGGGCGTGTGGGGCCTCGGGACG 

AGGAGCTACATGGCGCCGGAGCAGGAGCGCCACTCCGGGAGCGTGGACGTGAAGGCGGATGTC 

TACTCGCTCGGCGTCATCCTCTATGAGCTGCTCGAGGGGCGGACGCCCGACGCGCCGAGCGCC 

GCGTGGCCGCCCCCGATGAGCGCCGCCACGCCGCCCGATCTCGTCGCCCTCGTCCACCGGGTT 

CTGGCGTTCGATCCCGATGCGCGGCCGCGCATGGCGGAGGTGGCGAGCGCGCTTCACCGGCTC 

GGCCGGGCGAAGAAGGAGCTCGACGAGGCGCTCTCGAGGTGGGTCGTCGGCGGAGGGGCGCCG 

GGGCTCTTGCCGTGCGGCTATGCTCTTCTCGAACTGGTCCTCCTGGGCCCTGGGAACTTATAC 

GATTCTTTCCAGCCTGTAAGTGCATTTTTCTTTCAATATCGTCCTCTCTTCATATACGAGGTG 

AGTTCTCTGAGGTCCTCCTATAAGTCTGGGGTGTCCTATTCGGCCTCTTACTTGTTACTTCGC 

CTTCTTAGGAGTTTTTCCTTAATTTTGCCCTCTTACATTCCCGTATTCATTCTAACTGGGCCC 

TATCTCATTCGCTAATACGTTTCTGTATTGTGTACATCTCCTATCATGTGTCAATACTTGTTT 

CTGTTTATCATTATTCTTATTGTTTACGCTCTTATTTCATTCATAGTATAACATTAGTTTA^ 

GATTATCGCACTTGAATTCGCG 



or its complementary strand, 
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10 



15 



20 



25 



(b) DNA- sequences which hybridise under stringent conditions 
to regions of DNA- sequences according to (a) encoding proteins 
or Co fragments of said sequences, 

5 (c) DNA-sequences which hybridise to the DNA-sequences accord- 
ing to (a) and (b) because of a degeneration of the genetic 
code» 

(d) allele variations and mutants resulting by substitution, 
10 insertion or deletion of nucleotides or inversion of nucleotide 
segments of DNA-sequences according to (a) to (c) , wherein the 
variations and mutants offer isof unctional expression products. 

8. DNA sequence according to claim 6 selected from the fol- 
15 lowing 



30 




(a) 


open reading 


frames : 
















Nucleotide Pos-tion 










35 


20 


ORrl 


1666 - 


1 


Seq 


ID 


No 


3 




0RF2 


1605 - 


3338 


Seq 


ID 


No 


4 






ORF3 


6100 - 


3398 


Seq 


ID 


No 


5 






0RF4 


7110 - 


6374 


Seq 


ID 


No 


6 


40 




ORF5 


9590 - 


8433 


Seq 


ID 


No 


7 




25 


ORF6 


11393 - 


9855 


Seq 


ID 


No 


8 






■ ORF7 


13656 - 


12712 


Seq 


ID 


No 


9 






ORF8 


15374 -. 


18984 


Seq 


ID 


No 


10 


45 




ORFS 


20003 - 


27889 


Seq 


ID 


No 


11 






ORFIO 


28251 - 


29402 


Seq 


ID 


No 


12 




30 


ORFll 


31720 - 


30401 


Seq 


ID 


No 


13 



50 



55 
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0RF12 


31982 


- 32932 . 


3eq 


ID 


No 


14 


0RF13 


33128 


- 33613 


Seq 


ID 


No 


15 


0RF14 


33661 


- 34007 


Seq 


ID 


No 


16 


0RF15 


35611 


- 35255 


Seq 


ID 


No 


17 


0RF16 


37856 


• 35730 


Seq 


ID 


No 


18 



or DNA sequences complementary to said open reading frames, 

(b) DNA-seqiiences which hybridise under stringent conditions 
to regions of DNA sequences according to (a) encoding proteins 
or to fragments of said DNA sequences, 

(c) DNA-sequences which hybridise to the DNA-sequences accord- 
ing to (a) and (b) because of a degeneration of the genetic 
code, 

(d) allele variations and mutants resulting by substitution, 
insertion or deletion of nucleotides or inversion of nucleotide 
segments of DNA-sequences according to (a) to (c) , wherein the 
variations and mutants offer isofunctional expression products, 

and peptide sequences corresponding to said open reading frames 
SEQ ID No 19 (>0RF1) 

VDPEREAVTLGLAFNRAQGRTYARGPEARAEYIGTAMRAADVIEDRFEIERLAVSGGMGDVYR 
ARDRVSGQAVALKVLQGASANDLRRFAREAEALVTLRLPGWQYVAHGVTGAGRPYIJ^MEWL^ 
GVTLEERLAGAPLTIJ^SVALAARVATTLGAIHWLGVVHRDLKPSNLMLVGGAVERVTLLDFG 
lARHLRLAPTLTSPGAVLGTPGYIAPEQVRGDAPVDARDVFALGCVLFQCLAGRPPFLGNSAL 
ALI^RVVLEEPPRLGELRDGIPEPLERLVARMLAKNAGERPRDGAAAAAELAAVAGEGLSIGA 
SAVAAPAAPGEAI TTAERKVMCVI LAEDGGAEAGATLSEDDGAARAEALRDI AARHGGRLDRL 
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QARWWLVALSGAESPTDI^TRAAKCALALRA?U.C<3VPVSVATGLAEVEARLPVGELVDRVAQL 
lAGRDGT.SPPEIRLDDATASLLASRFETVQGPGGCWLRGPKEEPDAVPRLLGKPTPCVGRERE 
LSQLATEWRKCVDEPSANAWWGAPGLGKSRLAWEFLRTLEQREGAAI 

SEQ ID No 20 . (>ORF2) 

VRPCARLNASPSVTASRSGSTAAGSVKASTSACVEQPATGRTQPASPRWPPGAAALRLTSAMP 
RWFNTAGPCNPADHYMLPAEERLPAVRDLVDRKAYFVLHAPRQIGKTTSLRTLAQDLTAEGRY 
VAVLVSAEVGAPFSDDPGAAELAMLAEWRGTAGAQLPADLRPPPFPDAPAGQRIGAALRAWAQ 
AAPRPLWFLDEADALRDATLVSLLRQIRSGYPDRPRDFPHALALVGLRDVRDYKVASVDSGR 
LGTS S PFN I KVESLTLRNFTRDEVATLYAQHTAETGQVFRPDAVDRAFELTQGQPWLANALAR 
QLVEVLVKDRAQPITSANVDRAKEILIERQDTHLDSLVDRLREPRIRAVIEPMLAGTALPSVP 
PDDLRFAIDLGLVRMTAEGGLDVANPIYRSIIVRELAFPIRASLPQIKATWLTQDGRLDADRL 
LDAFLSFWRQHGEPLLGAAPYHEIAPHLVVMAFUiRVVNGGGTVEREYAIGRGRMDLCVRYAG 
ETLAI ELKVWRDGRPDPVAEGLAQLDEYLAGLGLDRGWLI LFDQRSGQPP I AERTRRERALS P 
AGREVAVIRA 

SEQ ID No 21 (>0RF3) 

VTIKKTFRSIDPATLPKHFDSPVAELRIiADLWEADGTYRYDPSRPREETFVVDTPPPTASGSL 
HIGHVFSYTHTDVWRQRRNRGFNIFYPMGWDDNGLPTERRVQNYFHVRTDVRTPYERGLTLP 

Oaapetikkepprivsrpnfielch:<vtredeqvfkal?rrvglsvdwrneyatiddhcrrta 

OLSFLDI^KGHLYSVFAPTMWDVDFQTAVAQAE\TDRPQSGAFHDIAFAVEGTAEELVIATT 

rpellaacvgvtahpedpryqhlfgktaltpifrapvpifpsplvdrekgtgilmvctfgdat 

DVIWWRHQKLPLRQMLGKNGRVLPVTFGEGAWESRDPAAANAAYAPLQGRGVKQARAAWELL 

rreehaaapgrgpalrgeprpieravkfyergdqplefvptrowfvrladkkaelleygdkik 

WHPDFMRLRYRNWTEGLQGDWCISRQRYFGVQFPVWYPLDAEGNPDHSRPLLATREMLPVDPT 

vdvppgyeasqrdqpggftaesdvfdtwftssltpqisshwgddparharlfpadlrpqahdi 
irtwafytiakammessvpwhhvaisgwildpdrkkmskskgk^tpmhlldtyssdato^ 

SASAiy^GTDTAFDEKVLKIGKRLVTKIWNASKY\TaSQSAEVHPISEELDRALLHKLSAVVDDA 

trsfdehefaaalertedffwrwftdaylelakarargeggageaargsavaalrlglsvllr 

LFAPVLPYI.TDEVWRWVYAEETGDTS I HRAKWPSAADFAAVAAPSDPGLLDLAAAAMAAVNKR 
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KSELGASVGRWTDIALGANAATIARLKPALGDVLTAVRAGAFJVLVRPEIADGEVLVVRCEL 
PAAAAAAGAGGAAASEE 

SEQ ID No 22 (>0RF4) 

MIHAEPFEARLVAARPLSPFVRELSFERADGRSFLFEAGQWVNLVLPLPGGEVKRAYSTASAP 
DGSPRFDLAVTLVQGGAGSEHLHRIHPGATLRAIGPHGLFTRDPGDSAPSLFVATGTGITPLR 
SMLRASLRAGLAAPHLWILFGARFEEDVIYRDHLEALARGSDRIRYEITLSRGGPSWAGRRGY 
VQAHVPELYRELAEKSGDPAPHVF r CGLDRMVS S VREIiARGELGVHRKHVHVERYD 

SEQ ID No 23 (>0RF5) 

MKSLPSDRAARiiAQSDIRTMT]^CA:<VHGINMSQGVCDTPVPSVILQAVKEAMDRGCOTYSRF 

DGIVELRHAIAAKLARHNGIAADPETDITVSAGATGAFQATCMALLNPGDEVLLFEPFYAYHA 

QAILAVEAVPRYVTARSLSWNVDGDSLERAITPKTKAIVVNSPGNPSGKVFGRMELEQIADLA 

CHHDLMVITDEIYEYFIFDGREHVSVASLPRMSERTITIGGYSKTFSITGWRIGYSVADARWA 

KAIGAMSDLLYVCAPTPLQHGVAAGIRGLPRSFYTGLAQGYERKRDRFCRALEKAGLPPCVPQ 

GTYYVLADVSRLPGRTGRERAIYLLDETGVAGVPGDAFFEGTQGSRFMRFCFAKTDEDLEEAC 
QRIEQLA 

SEQ ID No 24 {>0RF6) 

VSDPRKERLGDMDLEEFRRIGMRIIDWAADYLGHPDRYPVFPAIRPGDVKGRLAPTPPVEPEP 

MDAVLTDFEQI I LPGITHWNHPRF FAYFANTASGPGI LGELLAACLNVN\mWRTS PAA^^^ 

ELVLSWLRQMLDLDAGLHGAIMDTA3TASMVAIAAARDSAEPTIRLRGMAGQRRMRLYASEQA 

HSSIEKAAITLGIGQEGVRKIPTDPAFRKVPEALRAAWEDLGAGLRPFO/AATVGTTSTTSV 

DPIPAIVSVCREHGLWLHVDAAYAGMAAIVPEHRDVLAGCEGADSLWNPHKWLFTPMDCSVL 

YVRDADRLKRAFSLVPEYLRTEGD^OWMDWGIQLGRRFRALKLWMIVRYFGHEGLAARIREH 

LRLGQQIiAQWVDADPDWERLAPTPrSTVCFRMRPSALACIMRSADEAERESIERELDRLNEAL 

LDEVNKSGRVFLSHTRLHGRYTIRVAIGNIRSDEVAVREAWECLRAAGARLCADERFVSCSRS 
ADEGRGKS 
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SEO ID No 25 (>ORF7) 

MRREEPVLEAFYERYCAAPRETSYHVELPVDVELHQEAAPALPOARSLELAGRVALVTGSSRG 
IGKAIALRI^EQGADVAVNYHSNKDAAEQTAAEIRALGRRTMWQADVTRPNAAAELFSSVEA 
QU;PIDILV^mGDFFFK?IJ^TDDEV^lNVMDSNLSSVHYLCRAAVARM^ 

SPTYAI RGAPNVAAYS I AKTGVLI LTRSLATEEAPHGI LVNCVSPGLIDNGYLPPAQKEWMER 
RVPMGRLGRASEVADAVAFLASDRASYVSGANIAVAGGWDWTDRGTEHDRRVDLFIGHEEP 

SEQ ID No 2 6 {>0RF8) 

MSGRFPGARNVEELWQKLRAGVECVVTFTEAEAIJ^GVSREMIANPSYVRRGAPLDGVELFDA 

SFFGFSPRHAESMDPQQRIFLEVAWEALERAGYDPDAHSGPIGVFAGSAPSGYHSLAQSDPEI. 

LGALGHYOLTLNNDKDYLTTHASYKLNLRGPSVCVQTSCSTSLVAVVMACQSLi^EC^ 

GGVGIHAHQRRGYLYQENGISSPDGHCRAFDVAAKGTVGGSGIGTVVLKRLADALADGDHVHA 

VIRGAAINNDGSSKIGYTAPSVQGQAEVIGMAQALAGVEPDDISYIEAHGTGTPLGDPIEIAA 

LTRVFRAKTARRQFCAIGSLKTNLGHLDAAAGVASLIKTVI^EHRELPPSLHFERPNPKLEL 

ESSPFYVNTRLTPWHAARGPRRAGVSSFGIGGTNAHVVLEEAPAPPPSGPSRRWQLLTIAARS 

EAGLARATADMIEHLDRHSGTSIADVTYTSHVGRRAWPFRRAWGESAADLRAALASEGSPRS 

IS3CQAARERPVVFLFPGQGAQHLFMARELYEVEPIFRQSLDRCAELLRGPLGLDLRQVLYPA 

EGCRDDAEQELGRTAIAQPALFAIELSLAKLWMAWGIVPQAMIGHSVGEFAAACLAGIFREED 

ALRLVAERGRI^QQMPPGAMIJ^VPLAEPEIAPYLSDDISIJ^INGPALSWAGPIEArDAIJ^ 

ELLDHGLSCRRLHTRHAFHSKMMAPWDAFTRCVSAVERRPPSGKFLSTLTGGWISPEAATIP 

AYWARQLVEPVRFAQAVRQLLSESTWLWLELGPGQTLSPLVRQQARADGGQVWASLPRAKDA 

GADHLAV I EALGRVWS AGGTVDWKRFHEGS ARRRVLLPTYPFERQRYWAS PRHTS APPEAI I K 

PLLAKNPNVADWFFLPAWRRSDPPVSFDAQAVTTRRSTWLVFIGDEGLGAALVEGLARRGHEV 

VAWTGERFEQTGTQRYTIDPAANGDVASLFARLEIEGRMPDRIVHAFCTSPADGARIERGAA 

LEIERRLGFDSLLLLAQVIAAORHPKPLMLGVITTRAHSVIGTEI lEPLRALVLGPCRVI PQE 

IPHVSCRNIDIDLPGEGGRAEIAARLIADLERESPDSWAYRGGRRWVESIELTDVGRRSAGA 

APRLRQRGAYLITGGLGGIGLVAAELLAREAHARLILVGRTGLPARQGWDDWIoAAHGAGDATS 

RKI LR I RALEEAGAEVKI AAADVSDFNAMRS VI EEARTRFGRI DGVI HS AG I ASGGMI QLRTP 

MAAWRVMAPKVGGTLVLDAJ.LRDERPDFLLICSSLASLVGGATQIDYCAANAFLDAYAQSREG 
EEGCRVISVQWDTWSDVGMAVDFKLPADLQEGRRESLKRGISSSEGAEVLGRILSAGMSGPLA 
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ICTSDLPAYKQSVTTRRSQHEQTPAARPMHSRPTTTGAYVAPETETERRIAAIWQDLLGLEQV 

GANDDFLQLGGHSLLATQVLSRVLQTLKVGISLPQFFDAPTVAGLSRLVDAARAEGAGPVAPA 
I GRVERDAYR I KP P AAEQAARTKP 

SEQ ID No 27 (>0RF9) 

MEPVGGVDMNQPAKQQETCVFPTSFAQRRLWFLDQLEPGSAVYNMFASFRTRGPYDVDSLVRS 

VNEIVRRHESLRTTVDVIDGEPVQVIAPSLRIEVPVVDLSEIDEPEREAEARRimEESRRPF 

DLTRGPLLRAKLLRLGEADHVLILTMHHIVSDGWSMDVLFKELSTLYAAFHEGRPSPLPSLPI 

QYADFAWJQRELLQGEVLESHLGYWREHLRGAPTLLELPMDRPRPPAQTFRGSQRAFRLPLSL 

QQAVQALSRQEGATPFMTLLTAFSVLLSRYARQSDLWGTPIANRTRAELEGLIGFFVNMLAL 

RI DLGGDPS FRELLGRVREVTLGAYAHQDLPFERLVEELS PGRS PSHS PLFQVSFTLQNTPMD 

ATNRADIASGGAPLVEMKAA}<^DLILELSESPQGLLGTFEYNTDLFDAGTIERMAGHL 

SAVAAPDRPIAELPLMGAEERSRVLVEWNSTAALYPEDHCMHELFEQQVERSPEATAVLLQQQ 

TLTYREIJ^MRANQIAHHLRSLGVGPEVRVGLYLERSIETWAILGVLKAGGAYVPLDPT^ 

RLGLMMADAAPSVLLTQASLLSKLPPHGDATLVQLDALHEALSRLPHHTPRSGVTAQNLAYVM 

YTSGSTGRPKGVLVEHRGLCNLPTVQAKLYGIAPGDRLLQFAPLCFDTSFCEIALALLSGATL 

VMGTADELLPGPPLVELLKKHAVTAMLIAPTVIJ^PEQQSAALPLRVLTM^^ 

WKAPGRRLFNSYGPTETTIWASSAADLSDERIPPIGRPIANTQIYVLDEALEPVPIGVPGEIF 

IGGVGVARGYHGRPDLTAERFVPDPFGQTKGARLYRTGDRARWLPDGNLEFLGRNDEQVKVRG 

VRIELEEIRAALLKHPAVAQAVAWREDTPGDKRLVAYWGRGGARVTAAELRQSVSERLPAT 

MVPSSFVALDALPLTPNGiCVDRRALPEPEQSAGGEDHVAPRNAVEEELARIWASVLRLERVGV 

HDNFFEIGGDSILSIQIWRAQQAGLRLTPRQMFQHQTIAELSTVARAVEAVHVEQDPVTGPA 

PLTPVQRWWLEQEAAEPHHFNQSIFLEVRERLDESALEQAIAHLIDHHDALRLRLARDERGAH 

QVFAAPGGSTPFQRVDLGALPSAEQISAMEKAASEAQASUDLAAGPVVRAVLFDLGEVAPQRL 

LVIAHHIAVDSVSWRILLDDLFGAYEQARRGEAVRLPPKTTSVKRWAELLTEHAGSEAVKAEL 

GYWLDSSRRTVAPLPVDRRAGEDVWGSARHIWSLTPEQTEQLLREVPQAYRTRIDDALLTAF 

AQAI ARWTGSPAVLLDLEGHGREELAGVDLTRTVGWFTAMYPILLRVDAADPGEALKS I KEQL 

RAVPGRGLGYGLLRYLRSDTIAEVRALPQAELCFWYLGQLDQAIPEAAPFRPAREYQGSERSP 

GAHRAHLIEVNASIANGRLYATWTYSERRHEPETIERVAASFVTALRALIAHCTLPEVGGNTP 

SDFDKVRl^RQETIDALDAIDAGPGPSARGSRIEDVYPLSPLQEGILFHTLYATDYTAYVEQFH 
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WTLEGDFDAEAFTRALQDWARHAAIRTSFAWERLDAPLQIVRTGAVLPVEHQDLRGLAAEEQ 

TAHISRYVEAERQRRFDLRKAPLMRAGLLRLRKDAWCLVETIHHLILDGWSTOILLKEVFTLY 

EAHRGHRGHLALELEOPRPYGDYIGWLAKQDQVRTAAFWRRELEGFSAPTPLGVDRAVPHDDG 

GPRFGWRRIALSGDDAARLAAFARQKQLTMSTLVQGAWALLLSRYSGDPDVLFGMTVSGRSAP 

IPGIERMTGLFINTIPVRVREPADASVLAWLKALQEHEAELLEKEH3PLVEVQAHSDVPRGTP 

LFESLWFENYPVQVIFEAPPVEGPTRAEEGLRMIDAQYISDPPYPLTWAAFHGTLYLNIGY 

ERRRFDDOAVERMIGHVTTLLRGFVQRPETSVRDLPLLTAEEERTQLHAWNATAAPYPEGHCM 

HELFEQQVERSPEATAVLLQQQTLTYREI^IRANQIAHHLRSLGVGPEVRVGLCLERSIET^ 

AIUSVLKAGGVYVPLDPTYPSERLGLMMEDAAPSVLLTQTSLLSKLPPHGDATLVQLDALHEA 

LSRLPHHTPRSGVTAQNIAYWTSGSTGRPKGVLVEHRGLCNLPTVQAKLYAIAPSDRLLQF 

APLCFDTSFCEIAIJVLLSGATLVMGTADELLPGPPLVELLKKHAVTAMLLAPSVLAALPEQQS 

AALPLRVLAMAGEACPAELVKRWKAPGRRLFNSYGPTETTIWASSAADLSDERIPPIGRPIAN 

TQIYVLDEALEPVPIGVPGEIFIGGVGVARGYHGRPDLTAERFVPDPFGQTKGARLYRTGDRA 

RWLPDGNLEFLGRNDEQVKVRGIRIELEEIRAALLKHPAVAQAVAWREDAPGDKRLVAYWG 

RGGARLTAAELRQSVSERLPATMVPSSFVALDALPLTPNGKVDRRALPEPERSAGGEDHVAPR 

NAIEEELTRIWADVLGAKRVGVHDNFFDLGGHSLLLVRVHDRLGQRFDRPPSMVDLFTYPTVA 

SLARFLGERANGKQSPREAAADVTERGRRRLEARARRAKAIRGPT 

SEQ ID No 28 (>ORF10) 

MKHNIGWLLPAALATIAFVPACSPOTGEDAPS^/TSAESGAAPSADCVALGAKLQAALDGAAAA 

QKAPGAAAAVQSGDCVWRGATGVSDLVAStPTKPGDLFRIGSITKTFVSTLILMLRAEGRLSL 

DDAVSKYVKGIPAGDQMTLRQILGHTSGLFDYTYSPALGQMIEVDPTRAFAPAELIALATAEA 

PYFAPGAGFRYSNTNYIVAGLVAEAVSGGTLAGLLRTRILDPVGLAHTYLDGAEPPVQGLIRG 

YGDYGAGLVDITDQLSPTEAWAAGALVSNVDDIJ^FFALLISHELLSSDELQDMTTWTPTIW 

HEPGYGLGLIERDSALGSLNGHCGIIWGFOSASYGVPGRGDAITALINRSDGDAARLVDELAK 
WKER. 

SEQ ID No 2 9 (>0RF11) 

MSIDRAVLEQLDRVGGRIAEGKALKLLEDIAWPREVEERFFAAGEDRLPEVEYRTORDGLARR 
VAELRELLgAIDGDAPALGWLRDNVRAQIQAAELLEAAGTRAFSARSQELYGGARSRFFGGSL 
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RNIDIAEHLTERLRVHGWDEASDPEEEPU^AGALRDMIJWU^VAGl^PRLDLE I TA/DP^^^ 

VAGMSRVRIRPEATFAAWEAEGLWHHEVETHALTAHNGAAQPRCAFLRSGGPRTTRTQEGLAI 

FAELYSRSLSIGRLTRLAERVRLVDMAEQGASFLDLYRHLRERGAHRRDAYFDAQRVCRGGLV 

EGGAPFTKDACYLAGLLEVYAFLAJ^VLRGGLRDEVELLVCGRIALDDIAVLAELRAAGVLERP 

RYLPGWLRAWQTLLPYFAFTSFMDGIDLGPVERHFQELLRVAADARPAGEGRRRRGRPREG 

SEQ ID No 30 (>0RF12) 

MSESVAQLEEHRAALTGHCYRMLGSVVDADDAVQETMVRAWRSLDKFDGRSSLRTWLYRIATN 

VCIDLRADRARRARPIEEGPVGTVDDALETRPRTHWLEPVPDAHALPADIDAAERAMLRQSIR 

LAFVAALQHLPPKQRAALLLTEVLGWSAAEVADSLNTSVAAINSALQRARATLASRDLGDARP 

SLPEPQSALLDRYVNAFERYDVDALTALLHQDATLSMPPFTLWLRGHESIRAWLVGPGAGCRG 

SRLIPTAASGSPAFAQYRPAPEGGHRAWALIVLDVAGDRIVSMTSFLDTETLFPRFGLPLDLP 
A 

SEQ ID No 31 (>0RF13) 

VTI AS I DHRDQDLOTGPQAKAPARAAAPDAAPSRRAVWAGRVLSGIATLFLTFDAAVKVLKliF 

PAEASTAELGFPAHLVPTLGYLQIACLVAYLIPRTAVLGAILWTGYLGGAIAIHVRVENPLFS 
HTLFPI YVAAFLWAGLWLRDRRVRALTAS PS SQGR 

SEQ ID No 3 2 (>0RF14) 

MTTKNPRKXFVNLSVRDLKRSMEFFSKLGFEFNPQFTDEKAACMWSEEAYVMLLVESF^ 

MKKEICSTSTHTEGLFALSCSSRAEVDD^WKKAVAAGGSHAMDPQDHGFMYGWSFYDVDGHHW 
EVMWMDPKAIQP 

SEQ ID No 3 3 (>0RF15) 

MTPSERLDATFAAIJUDPTRRAIIARLASGEASVTELAKPFAMSQPAISKHLKVLERAGLISRG 
RDAQRRPCRIEAKPLEDASGWLDNYRRFWEGSYERLDDLLEELKERESKGERSKR 
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SEQ ID No 34 (>0RF16) 

VAPASAPAAGGRDAAPF1.DEAAQWLRGEQAPASRPAGEGPAGRLPGRVLVADDNADMREYALR 

LLVAEGWTVEAVADGRAALERARAHPPDLVLTDVMMPRLDGFGLLRALRADDRTRGVAVVMLS 

ARAGEEARVDSLEAGADDFLVKPFSAKELIJ^VRIHVEUyiRRREAEGQRQYUIDL™^ 

lAILRGPEHVFEWNPLYQRLVGGRSLVGEPIRAALPELEGQGIWELLDAVVRTGEPIVGKEL 

PVRLDRRGDGTTEEVFFNFVYQPMRDRDGAVEGVFVFAFDVTDQVRARRi^VEALVEALKIJU^ 

RKDEFU^IJUIELRNPMAS I SLSLTLLDDADGDGPASARYREI ARRQMGHLVRLVDDLLDVSR 

ITRGTVELRLEDVDI^WQSAAAAWPAVEARRHDVSLSVGPGDFGMRADATRLEQVVTNLL 

TNAAKYTPPGGSISVRLTREAAVGAPEAVLRVRDTGRGIPAAMLEKVFDLFTQVDQTIDRSTG 

GLGLGLTLVRRLLELHGGSVAAASAGPGQGSEFTVRLPLGPGAAPOPAPSAGPPPPREGPPPA 

QRDEPPPPPAQRAEAPEAAADRRRVLWEDAEDVRRVMRAYIEALGHEVTVAVDGLEGVKKLL 

ELRPE^,^AFVDIGLPGIDGYEVARRARAAPGGEALYLVALSGYGGPDDQARSRRAGFDLHLTKP 
WGATLQDVLTAPRT 

9. DNA sequence according to claim 7 selected from the fol- 
lowing 

(a) open reading frames, and peptide sequences corresponding to 
said open reading frames: 

pEP0cos6_0RFl sequences: 

(1) nuclecbide sequence 

Seq ID No 3 5 { >pEPOcos6_ORFi . seq) 

GGATCACCTGCGGCGCGATCGCCGACCTCGTGCTGGTGTTCGGCTCGCTGGATGAGAAGCCGG 
CGGCGCTACTGATAGAGACGGCGACGCCCGGGCTGCGGGTGGAGCGGTTGCGGGAGATGCTCG 
GCTTTCGGGCGGCCCACCTGGCGAAGCTGTCCTTCGACGGTTGCGAGGTCCCCGAGGCTCAGC 
TGATTGGCCGGCCCGGCTTTGCGCTGATGTATCTGGCCCCCTACGCCCTGGATTTCGGTCGGG 
TCAGCGTCGCCTGGGCCTGCCTGGGCATGATCCGCGCTTGCCTGGAGACCTGCGCACAGCACA 
TCCTCACCCGCCGCACCTTCGGCCACCTGCTAGCCGATCACGGCATGATCCAAACCCTGATCA 
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CCAACCTGGGGATTCACCACCAGGCGACGCTGCTCCACACGCTGCAGGCCTGCCGCGCCAGGG 
ATCGCGGCGACGTGACCGCCTCCGAGGCCACCCTCGCCGCCAAATACCTCGCGTCGCGGACGG 
CGGTCCAGGAGACGACCAACGCGGTCCAGATCATGGGCGCGCTGGGCTGCGACGAGGAGGGCG 
CGATCGCCCGCCACTTCCGCGACGCCAAGACGACCGAAATCATCGAAGGCAGCAACCAGATCA 
TCGAGGCGCTGCTGGCCAAGAACATCGCCCGCGCCGGTCGCGACAACTATCGCCGCTTCCTCG 
AT3CGGAAGTCGAGCCCGGTCGGGCCGGAGGCGCACCA 

(2) peptide sequence 

Seq ID No 36 ( >pEP0cos6_0RFl . pep) 

ITCGAIADLVLVFGSLDEKPAALLI2TATPGLRVERLREMLGFRAAHLAKLSFDGCEVPEAQL 
IGRPGFALMYIAPyALDFGRVSVAWACLGMIRACLETCAQHIl-TRRTFGHLLADHGMIQTLIT 
MLGIHHQATLLHTLQACRARDRGDVTASEATLAAKYLASRTAVQETTNAVQIMGALGCDEEGA 
lARHFRDAKTTEI lEGSNQI lEALLAKNIARAGRDNYRRFLDAEVEPGRAGGAP* 

pEPOcos6_ORF2 sequences : 

(1) nucleotide sequence 

Seq ID No 3 7 (>pEPOcos6_ORF2.seq) 

ATGACGAGCGCGGTCCCGACGCGTCAAACCAGCCTGCTCGACGACTTCGAGCGCGTCGCCGAC 
GTCGATCCAGAGCGGATCGCCGTCCACGCGAGCGAGACGAGCCTGCGCTATGGCGACATGAAT 
GCGCGCGCCAACCGCATTGCCCACGGGCTACGGGCGCGCGGGATCGGGCCCAATCAAATCGTG 
GCGGTGGCGATGGCCCGCACGCCCGAGCTGATGATCGTGCTGTACGGCATCCTCAAGGCCGGC 
GCGGCCTACATGCCCATCGCCCGCGACGCGCCGCCGCTGCGCCGCGATCATATGCTGCGCGAG 
AGCCAGGCTGCTCTGATGATCGCCGACGAAGAGATCGCGGGACTCGCGGCCCGGGTGCTGACG 
CCGGCCGACCCGTTCTTCGCGGCCATGCCGGACCACAACCCCGAGCCGCGTCACGACCCGACC 
GACCTGATTTACGTCATCTACACCTCGGGCTCGACCGGCCAGCCCAAGGGCGTGGCCATGGAG 
CACCGCGCCGTGTGGAATCGCCTGACTTGGATGCAGGCCCAGTATCCAATCGACACGCAGGAC 
GTGATCCTCCAAAAGACGCCGATCGTCTTCGACGTGTCGGTCTGGGAGCTGTTCTGGTGGCCG 
CTGGCCGGCGCCTCGGTGGCCCTGCTGCCGCAATCCATGGAGAAGTTCCCCTGGGCGATATCG 
GCGACGGTGGCGCGGTGCGGGGTGACGGTGATGCATTTCGTACCATCGATGCTGATGGCCTTC 



wo 00/22139 



PCT/US99/23535 



1C2 

CTTCAGGTGGTGGCGGGCCGGCCCGAGATGGCGGACCAGATGAAGGGCCTGCGCTACGTCTTC 
TGCAGCGGCGAGGCCCTGGCGCCGGCCCACGTGTCAGCCTTTCAGGAGCACATCAACCGAGCG 
GGCAGCATCAGCTTGACCAACCTCTATGGACCCACCGAGGCGGCGGTCGACGTCAGCTACTTC 
GACTGCCCGCCCGGCGCGTCACTCGCGCGGGTGCCGATCGGACGAGCGATCACCGGCATCCAG 
CTGCTGGTCATGCGCGACGGCGTGCCTCAGCCGCCCGGCGTCGAGGGTGAGCTCGCCATCGGC 
GGCGTTGGTTTGGCGCGCGGCTACATCTCACGGCCAGACCTGACCGCCGACCGGTTCGTGCCG 
CATCCAGGCGGCGACGGCCAGCGGCTCTACCGCACCGGCGATCTGGTGCGCAGGGACGCGGAC 
GGCGAGCTGGTCTTCCTGGGGCGCATCGACCATCAGGTGAAAATTCGCGGTCTGCGCATCGAG 
CCCGGGGAAATCGAGGCCCAGATCAC-CGCCCATCCCGATGTGGCCGACTGCGCGCTGATTATC 
GAGCAGGACTCGGAAACCCTGCCCAAGCTGACCGCCTACATTGTCGTGGCGCGACCGGGCTTG 
ACCCGGAAGGCGCTGCTACAGTTCCTGGGCGCGCGGCTGCCCGACTACATGCTCCCGAACCGC 
TTCCTGACCCTCACGGAGCTGCCCGTGACCGCCAACGGTAAGCGCGACTGGCGCGCGCTGCTC 
GGCCCGCTCGAGACCCTGCCTCTCCCTTTCTCC 

(2) peptide sequence 

Seq ID No 33 (>pEPOcosc_ORF2 . pep) 

MTSAVPTRQTSLLDDFERVADVDPHRIAVHASETSLRYGDMNARANRIAHGLRARGIGPNQIV 

AVAMARTPELMIVLYGILKAGAAYXPIARDAPPLRRDHMLRESQAAMIADEEIAGLAARVLT 

PADPFFAAMPDHNPEPRHDPTDLIVn/IYTSGSTGQPKGVAMEHRAVWNRLTWMQAQYPIDTQD 

VILQKTPIVFDVSVWELFWWPIAGA3VALLPQSMEKFPWAISAWARCGVTVMHFVPSMLMAF 

LQWAGRPEMADQMKGLRYVFCSGIALAPAHVSAFQEHINRAGSISLTNLYGPTEAAVDVSYF 

DCPPGASLARVPIGRAITGIQLLVr^RDGVPOPPGVEGELAIGGVGLARGYISRFDLTADRFVP 

HPGGDGQRLYRTGDliVRRDADGELVFLGRIDHQVKIRGLRIEPGEISAQISAHPDVADCALII 

EQDSETLPKLTAYIWARPGLTRKALLQFLGARLPDYMLPNRFLTLTELPVTANGKRDWRALL 
GPLETLPLPFS 
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pEPOcos6_ORF3 sequences; 

(1) nucleotide sequence 

Seq ID No 39 ( >pEPOcos6_ORF3 . seq) 

ATGTTACACCCGATTCCCACCGACCGTTTCGCCCTGAGCCGACCGCTCTTTCGCGGGTACCTC 
GCGCACGATCCGATCGTGCAGGGCGTGCTGGCGGGCGACCATCCAGGCTGGGTCCTGGTGGAC 
CGCGAGCCCGAGCCGCGCACGGCGCTGCTGTGGGCCTTTTCCGATCGGCTCTTCTGCGTGGGC 
GCAGCTGACACGCTGACCCCGCACGCGCTGGCCGAGCTGTTCCACGACCGACTGATCCCCCAG 
GCCCGTAAGATCGGGCAGCCGTTTTTCCAGGTTCAGGGCGAGACGGTCGACACCTGGTCGGAC 
CACCTGCATCAGGTGTCGCCGCACGCGACAGTCTCCTTCCGCCAGGCATTCCGCTTCGACCGC 
GACCTCTTCGAGCGGCTGCCAACCAAGCCGGAGCTGGCAGAGGCGCGGCTCGTGCCAATCGAC 
GCGCGGCTGCTGGCCGAACAGGCTGATCTGCGCGAGCGGATACTGGCCTCCTGGTCCAGCGAA 
GCTGCCTTCCATGCGCGCGGTTTCGGCTTCTGCTACCGCGTAGGTGACCAGCTGCCGAGCGTG 
TGCCTGGCATCGCACGTAGGCGGCGGCGCGGCCGAGCTGAGCATCAACACCGAGCTCGAAGCG 
CGCAATCGAGGTATGGCAACGCGGCTGTGCCGGCGTTTCATCGCCGAATCGCTGCAGCGCGGC 
CTGACGCCTTGCTGGGGCACCGAGACCTTTCGCCTGCCGTCAATCGCGCTGGCCCAGAAGCTC 
GGTTTCATCCCGACCTTCACCTTCCCCACCTACTGCTTCGCGACCGGCACCGAACAGCCGGAC 
GACAACTTCCTAGGCGAGCTGTACTACAGGGAATCGCGCATCGCCGGAAGTGGGACCGATGAG 
CCGCAAGCGGTTCGGCTGGCGCGGGGTTGGAGCCT3GCCGGCGACACCGAGCGTGCCGCGAGC 
TTCGCCGCACGCGCCCTGGCCGAAGGGTGGGCCGGCCACTCGACTCTGGCCACCGATCCGGAT 
TTCGCCCGATTGCGCGCCAGCGCCGCCTGGCCCCGCCTCAATGTCCCT 

(2) peptide sequence 

Seq ID No 4 0 (>pEPOcos5_ORF3 .pep) 

MLHPIPTDRFALSRPLFRGYLAHDPIVQGVLAGDHPGWVLVDREPEPRTALLWAFSDRLFCVG 
AADTLTPHALAELFHDRLI PQARK'I GQPFFQVQGETVDTWSDHLKQVS PHATVS FRQAFRFDR 
DLFERLPTKPELAEARLVPIDARLIAEQADLRERILASWSSEAAFHARGFGFCYRVGDQLPSV 
CIASHVGGGAAELSIOTELEARNRGMATRLCRRFIAESLQRGLTPCWGTETFRLPSIALAQKI. 
GFIPTFTFPTYCFATGTEQPDDNFLGELYYRESRIAGSGTDEPQAVRLARGWSLAGDTERAAS 
FAARALAEGWAGHSTLATDPDFAr.lRASAAWPRLNVP 
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pEPOcos6_ORF4 sequences : 

(1) nucleotide sequence 

Seq ID No 41 (>pEPOcos6_ORF4 .seq) 

ATGATTTGTCACTCCCACCGCTTCATTTTCCTCCACGTTCCCAAGGTCGCCGGCACAAGCGTC 
AAGGACGTCCTCGGCCAAGAGCTATTCCAGGAGGACCAGGTCACGTTCCAGATCGCTCCCAAT 
CCCCACTACCCACCTGAATGGACTGCGCCTTACGAGGAGCACATTATTGCCGCTGAATTGAAG 
AGCCAGTTGGCGCCGGAAATTTGGGACGATTACTTCAAGTTCGCCTTCGTGCGCCATCCGCTC 
GACTGGGCGGTCTCCAATTACTTCTTCTTCCTGCGCGACCGCAAAGGCCATCCGGCCCACGAA 
TTCCTGGAGCGGAAGGGCTTCGCCGGTACCATGGACATGTTTTTCGGAGCGGCCGGGCGCCAT 
CCGCTGGTCGCCGGCATGCGCTTCAGCCAATGGGAGTTCTTGTGCGACAGCGAGGGCCGGACG 
CTGGTGGACTTCGTTGGCAAGTACGAGCGGCTCGAGCAGGACTTCGCCGCCGTGTGTATCCGC 
ATCGGGCTGACCCCGCCCGACTTGCCGTGCCTCAACCAGACTCGCCACCAATCCTTTACCAGT 
TACTACGACGAGGCTTTGATGCGCCAAGTCAGCCGCGCGTTAGCTCGCGATTTCGAAATTTTT 
GATTATGCC 

(2) peptide sequence 

Seq ID No 42 ( >pEPOcos6_ORF4 . pep) 

MICHSHRFIFLKVPKVAGTSVKDVLGQELFQEDQVTFQIAPNPHYPPEWTAPYEEHIIAAELK 
SQLAPEIWDDYFKFAFVRHPLDWAVSNYFFFLRDRKGHPAHEFLERKGFAGTMDMFFGAAGRH 
PLVAGMRFSQWEFLCDSEGRTLVDFVGKYERLEQDFAAVCIRIGLTPPDLPCLNQTRHQSFTS 
YYDEALMRQVSRALARDFE IFDYA 

pEPOcos6_ORP5 sequences: 

(1) nucleotide sequence 

Seq ID No 43 { >pEPOco36_0RF5 . seq) 

ATGAAAGTGGACAAGCGGAATGTCGACGACATTCTCGGACTCACTCCGACACAGACAGGCATC 
TTGTACCACTACCTGCTGGACCCGCAGGCCGACGCCTATTTCGAACAATTGACGCTGCACCTG 
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GAGGGGCCGCTCGACGTAGCGCGCTTCCGCCGCGCCTGGGAGCGCGTGGTGGCGGCTCACGAC 

CAGCTGCGCGCCGTGTTTCGCTGGCAAGGGATCGAACACCCGGTGCAGATCATCCTCAAGCAG 

CACGTGCCGGACCTGGAGTTGGCGGAGGTCCCGCGCGACGCCGATCCGGCAGCCTTCCTGGCG 

CAATGGGTCGCGGCCGACCGGGCGCGCAAGTTCGACTTCGAGACGGTGCCCTTTCGCATCGGC 

CTCTGCCG3ACTGATACCCAACATCACGTGATGCTGCTCAGCAATCACCATATCCTGATGGAC 

GGTTGGAGTACGGGCCTGATTCTGCGGGACTTCCTCGCCTGCTACGGCGACTCCGAAAACTGG 

CGGCCACGCACCCGAACGCACTTCAAGGCGTTCATCAAGTGGCACCAGAACCGGCCACGCCGG 

GGCa^VGGAGCGATTTTGGCGCGACCTGTTGCGCGATGCGCCCGACGGCGGCTTTCCCCGCCTG 

GGCGTCGAAGAAGGCACCCGCCACTCGCTTGACTTCGGCGCCCGCAGCCGCGCTCTCGACGAC 

CGCTTGACCCAAGGCTTGCGCGACATGGCTCGCGACCTCGACGTCACCCTCGCCGCGATGCTC 

CATACCGCTTGGGGCCTTCTACTCCAGCGCTACCAGAACAGCTGCGAAGTGATATTCGGGACC 

ACCGTTTCGGGCCGCAACGTCGAGCTCGCCGGCCTCGACGAGGTGGTCGGCTTGTTCATCAAC 

ACGATTCCGTTCCGCTTCTCGGCCGCGGCCGCGACGACGCCCGTCGAGGCCTTCCGTGCGGTA 

CAGCGCAATCTGCTGGCGAGAAGCGAGTTCGAAGCCACCCCGCTGGTGGACATCAAGGGCTGG 

AGTGGTCTCGGTCCGGGCGCGGAACTGTTCGACACCATCCTGGTCATCGAGAACTATCCCTTG 

GACCGCGCTATCTTCGAGAGTGATTCCAGCCTGCGGTTGACCGACCACCAAATCTTCGAGCGC 

ACCAATTACGGGCTGACCCTGACCATCGAGACCTTCAGCCGGTTGCACGTGACGCTAGCCCAT 

CGCCGTGACCTGCTGGGCGACGCGGCCGCTGAGCGAATGCTAGATCATTTCACCGGCCTGCTC 

CAAGCCATGCTGCGCTTCCCTCACCAGCCGTTCGCGCGCCTCGAGATGAAAAGCGAACACGAG 

GCCCACCGCGTCCTGCACCAACTCAACCAAACGCGTCAGCCGCTGCCGTCCCAATCGGCTTTC 

CACCAGTTGTTCTTCGAGCAGGCCCAGGCCGATC-GGGCACGACCGGCGCTGTGGTGCGGCGCC 

ACGCGCTGGACCTACGGCCAGCTGCTGGAACGTGCCCTGCGTCTGGCGGGACGGCTGCAGGAA 

GCCGGCTTCGCCCGAGGCGATGTCGCCGCCGTCAGCCTCGGCCCGGTTCCGGATCTGATTCCC 

GGTTTGCTGGGCCCGCTGTTCGCCGGCGGCGCCTACCTGCCGCTCGATCCCACCCTGCCGGCC 

CAGCGCTCGCGGTTCATCCTCGACGATGCCGGTTGCCGCTTCCTGATCAGCGACGCGCCACTC 

GCGGGGCCCACGCCGATCCATCCGGACCCTGCCGGCGCCAGCCCCGTTGACGTCATTTTTGCC 

TGTCAGGACGGCGCCGCGCAGCCCGCCTACCTGATCTACACCTCGGGCTCCACCGGCCAGCCC 

AAAGGCGTCTGGGTTAGCCACCGCAACCTGATCAACTTCCTGACGGGCATGAGCGCAATCCTG 

CCGG7CGCGGCCGACGACGTGTTCCTCTCGCTGACTACCGTGTCGTTCGACATTTTCGGGCTC 

GAGACGTGGTTCCCGCTCAGCCGCGGCTGCACGATCGTCTTGGGCACGCGCGCCGAGCAGTTG 



wo 00/22139 



106 



PCT/US99/23535 



GACCCGGCCGCGGCTGCCAAGGCCATCTCCTGCCATGGCGTCACGGTTTACCAGGCGACGCCA 

TCGCGACTCCAACTTCAACTGGAGCACCCCACATTTGTCCGCGCCATCGGCTCCCTGACGACC 

CTGCTGGTAGGCGGCGAACCCCTCCCAGCCGAGCTGCTGCGGCGCGTACGCGAAGTGACCGAT 

GCGCGTATCTTCAACCTCTACGGTCCCACCGAAACCACCATCTGGTCCACAGCCGGGGAGGTC 

ACCGCGGCGGACGTCCCGGATATCGGCCGCCCGATCGCAAATACCGGCGTTTTCCTTCTGGCG 

CGAGACGGCTCGATCCAGCCGCCGGGCCTGGTGG3CGAGTTGTGCATCGCCGGCGAGGGCGTG 

GCGTTGGGCTACCACCGACGGCCGGACCTGAACCGAGAACGGTTTCGCGAGATTCCGCCGGGC 

CGCCTGCCCTTTGCCGGCAAGCTCTACCACACCGGCGACCTGGCCCGCTGGACCGAAGACGGA 

CGGCTCCTCTGCCTGGGCCGTCTGGACGACCAGCTCAAAGTGCGCGGCCATCGCGTCGAGCCG 

GGCGAGATCGAGGCAGTGATGGCGCGCCACCCGGCGGTCACGCAGGCGGTGGTCGTCACGCGG 

CCGCGCAACGGCGAGCCGGTCTTGGTCGGGTTCTGGACTGCGGAAGGTGAGCCGATGCCAGAG 

GAAGCGCTGAGCGCTTACCTGGCCGACCGACTGCCGAGCTACATGGTACCCGAACGGTGCATC 

CTCATGAAGGCCATGCCGCTAACCGGCAACGGCAAGATCGACCGGCGCGCCCTACCCAATCCC 

TTCGCCTTGACCGAGTCGACCCGGCAGGCGGCGCCGCGCACCTTGGCCCGCACCGCCGGCGAG 

CATCGGGTTGCCGAGCTGTGGCAGGCCTTGTTGCGACGCGAGGCGATCGGCTTGGACGAACCC 

TTTTTTCAGGCCGGCGGGAACTCATTCGGCTTGATTCGGCTTCACGCCAAGCTGGAATCCGCC 

TTCGGGAAGTCGTTCCCGATCACCGATTTGTTCCAGCATACCAGTATTCGCAGCCAGGCAGAA 

ATGCTGAGCGGCTCGTCCGTCGAGGCGCCGCTCGCGGGAGCCGTGCCGCAACCCCCGGCCGCC 

GCCGCCCAAGTTGCCTCCTCGGCAGCTAAATCCCCAGGGGAGCGCGGCGCGGCAGCGACGTCG 

AGCGGCCTGACCGCGCAACCGCCCCAACCCCACTTCCGGCCCATCGCCGTTATCGGCCTCGCC 

GGCCGATTCCCCGCCGCACCCGACCTCGACGCCTTCCTTGAACTGCTCACGGAGGGTCGCTGC 

GGCATTCGCTTCTTCAGCCAAGCCGAGCTGCGCGACGAGGGTCTCGACGCGAATCGAATCGCG 

TGTCATAACTATGTCCCGGCCAAAGGTTTCCTCGACCGGGCCGACCACTTTGATGCCGACTTC 

TTCGGCATCCCGCCGCGCGACGCAGAAATCACCGATCCGCAAATTCGGCTTCTGCTTGAGTGC 

TGCTGGAACGCGCTGGAGCATGCCGGCTACCCGCCCGGCGGCGGCGAGATCGGGCTCTTCGCC 

GGCTCCTCGGCCAACTATCACTGGCTCGAATACGTGGGCATTTCCGAGGAGAGCAGCAATCGA 

TTCGCCGTCATGATTCAAAACGAAAAGGACTACCTGGCCACGCGGATCGCCTACCAGCTCGAT 

TTGAAGGGCATTGCCGTCACCGTGCAAACGGCCTGCTCGTCGTCGCTGACCGCGGTCGAGCTG 

GCCTGCGATGCGTTACACGCCGGCCGCGTGACCATGGCTTTGGCTGGTGGCGTTGGTCTGACC 

TATCCGTTGCGCGCCGGATACCTGCACGAGGATGGAATGATCTTCTCCCCCGACGGTCGGTGC 
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CGGGCCTTCGACGCCCAGGCGGCCGGCACGGTCTGCGGCAACGGTCTGGGCATGGTGGTGCTG 

AAACAGCTCGACGCGGCGCTGGCCGACGGCGATGCCATCCACGCTGTGATTAAGGGCATCGCG 

GCCAACAACGACGGCGCC-GCCAAGATCGGCTACACGGCGCCCTCGCAGAACGGTCAGGCGCGG 

GTGATCCGCGCCGCCCATAGGCTCGCCCAAGTCGCGCCGGAGACCATCGGCTATGTAGAAGCC 

CACGGTTCGGGCACGCCGCTGGGCGATCCGATCGAGGTGGCGGGCCTGACCGAGGCCTTTGAC 

AGCCCGCGTCGCGGCTTCTGCGCCTTGGGTTCGGTCAAGTCGAATGTGGGTCATTTGGATGCG 

GCAGCGGGCATCGCGGGTTa^CATCAAGGCGGTGCTCTCGCTGTCCCATCGGACCCTGTTCGCC 

AGCCTCCACGTCGACACGCCCAACCGGCAGATCCCGTTCGCCGACGGTCCGTTCCAGGTCAAC 

ACGGAGACCCGGCCCTGGCCAGCTGCCGACCATCCCCGCCGCGCCGGCGTCAGCTCCTTCGGC 

ATCGGCGGCACCAACGTGCACGCCGTCCTGGAAGAGGCGCCGCAGTTGGCCGAGCACGCGGGG 

CGGCGGCGCGAGCGGCAGCTGTTCCTGGTCTCGGCGCGGACTGCAGCCGATCTGGAGCGACGC 

ACCGCGGCGCTGGTCCGCCACCTGGCCGCGCATCCGGACCTCGCACCAGATGACGTTGCCTTT 

ACCTTGCACGCGGGCCGCAAACCGATGACCCACCGTCGTTTCCTGGTCGCCGCCGACCTCGCG 

GAAGCCGCCGCGCGTCTGGCCGAGCCCGATCCAGTCAAATCCGCCGCGGCGCGCGCCGACCGC 

TGCCAGGTCTGGATGTTCGCCGGTCTCGGCTCTCAATACCCCGGCATGTGTGGCGGCCTCTAT 

CGCACCGAGCCGGCCTTTCGCGAGCAAGTCGACCGCTGTTTCGACCTCCTCGCGCCGCGTTGC 

GATTTGAAGCCCTCGCTCTTCCCCGAGCCCGATCAGGCCATCGACGCATCAGCCCTCGCGGCC 

ATCGACACCGCCCAGATCGCCGTCTTCGTCTGGGAATACGCGCTCGCACGGATGCTGGAAGGC 

TGGGGGCTGCGTCCGGATCGGCTGATCGGTTACAGTTTCGGCGAATACGTGGCCGCCTGCCTG 

GCCGGCGTCTTCTCCCTGCCCGACGCCTTGGCAATCGTCCGCGAGCGTGGCCGGATCCTGGCG 

GCGGCCGAGCCGGGCGCGATGGTCAGCGTGCCCCTTCCGGCCGAGCGCGTCGCGTCGCTGCTG 

GAGCCGCCGCTTGCCTTGGCCATTGACAACGGCCCCTCATGCGTGGTGTCCGGGCCGGTCGAA 

CCGGTGCGCACCTTCACCGCTCGCATGAAGCGGGACCGGGTCTGGGTGACGCCGCTCCAGGCC 

GAGCGCCCGATGCATTCGCCGCTGATGGCCGAGGCCGGCGGCTCACTGCGCGCCATGTTGGCC 

GGGTTCCGCCTGAATGCGCCGCGAATCGCGATCTTAAGCAATGTTACAGGAACCTACCTAACC 

GACGAGCAGGCCCGAGACCCCGATTACTGGGCCCGTCACCTGTGCGGCAACGTTCGCTTCGCC 

GACGGTGTGCGAACCTTGTTGGCCGAGCGCGATCCGGTGTTCCTTGAATTCGGGCCGGGCCGC 

GATCTGAGCTCCTTGGTGCGCCACCAGATGCCGGAAGGCGCCGACGAGCCGATCGCACTGATC 

CGTCATCGCGAAGATCCGGTGCGCGACGAAGACCTCGTGCTCGATGGCTTGGGCCGCTGCTTC 

CTGCGTGGGGCGACCCTCCACGGGCAGGCCTTGTACGCCGGCCGAGGCTGCCGCCGCGTGCCG 
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CTGCCCGGTTACCCGTTCCAGGGTCCACGCTGCArGCCGGCCCGCGCCGGACTGCCCGGCCTG 

GCGCGACCGACCGTGGGAGCGACCACCATCAGCTACCGACCAGCCTGGAAGCGGGCGCCGCGC 

TTGGCGGCTGTCGAATCGCTCGCGCCGCAATCCTGGTTGGTATTCAGCGACGGCAGCGAATTG 

GCGGGCGAGCTGGTGGCCGGCCTGCGCGCTTCCGGTTGCGCGACCACCCTCGTCGAAGGTGGG 

CTGGCGTTCGCGCGCTTCGCGGGCGGCTTCCGCGCGAATCCCCGCGAGGAACAAGATCTCGCA 

CAGCTGTTCGCGACCCTGTCGGCCGAAGCGATGCTGCCCACCCACATCCTGCACCTGCTCAGC 

CTGCCGTCGCCGGAGCGCGACTCGCCGCTGGCGCGCCTGGAGCACCTCACCGAGCTGGGCTTC 

CACCATCTGCTGGCCCTGGCCCGCCAACTGGAGGCGGTCGGCGCCCCCGAGGTCCGCCTCGCC 

GTGGTGACAACCGGCCTGGCGGCGATTGGCGGCGAGTCCGAGCTGCGGCCCGAGGTCGGGCTG 

TTGCGGGGACCTGTCCGCGTGATTCCCTTTGAATTCCCGAACTTGCGGCTGCGCCTGATCGAC 

CTCGACTCGGCCGATCCCATCTGGCGTAGCGGTTGTGAGCCGTTGCTGCGCGAAATGGGCGCT 

GCCCCGGGACCTGAAGAAATCGCGCTGCGCGGCACCAGCCGTTGGGAGTTGGGCTACGAGCCG 

GTCGAGGGGGGCACCGTGAGCACCATCTCCTCGCGACTGCGCGAGGGCGGCGTCTATCTGATC 

ACCGGTGGCCTCGGCGGCCTGGGTCTGGCCTTGGCCCGTCACCTCGCCCGGAAGTACCGCGCC 

ACCCTGATCCTCGCTGGCCGGCGAGGCGCGCCGGCGCGCGAGCTCTGGCACCAGGCGCCAGCG 

GAGTTCGTACCGGTCGCAGCTGCGATCGCACAGATGGAGGAGTGTGGCGCCCGCGTGATTCCC 

GTCGCGCTCGACGTCACCGACGCCGACCAAGTGAACGCGTTGTTCGCCACCATAGAAGCTACG 

GTCGGCAAGATTGAAGGCGTTTTCCACATGGCTGGCATCGTTGACGGCGGCATCATTCGAACG 

CGCACGCGGGCTGCCAGCGACGCCGTGCTGGCGCCCAAAACGGTCGGAACCTGGATTCTCGAT 

CGGGCTCTCCGCGGCGCCGGTGGCCGCTTCCTGGTGCTGTACTCCTCGATCAACGCGGTCGTC 

GCGCCCTTCGGCCAGGTTGCCTACGCCGCCGCCAACGCCTTCCTCGACGCCTTCGCCAGCGCC 

CACGAACACGACGAGCGTCTTTTCCGCGTCAGCATCGGTTGGGACACCTGGCGCGAGGCCGGC 

ATGGCCGTCGATGCCGCCCGCGCCCGCGGCGACCAGGCCCCGCTCGAAGGGCTTAGCGACGAG 

CAGGGCTTGCGCCTGCTCGAAAGCGCCTTGGTCGGTTGCGAACCGCGACTCCTCGTCTCCATC 

AGCGAACTGCGCGCTCGACTAGCCGAGCATCATCGCAACGGCGGCATTCCCCGGTTGCTCGGG 

CCCCGCGCCAACGAGGCGGGTGCAGCTGATTCCGGCGAGGAGGGCGCCACGCAAGACGCGTCG 

CCGGCCCGTCGCGCCCGTCCCGATCTGGTCGTGGCCTTCGCGCCGGCCGGCAACGAGCTGGAG 

CGCCGGATCGTGGCCATCATCGGCGCCTACCTGCGGCTCGGTCAGGTGGGCGTCGACGACAAC 

TTCAACGATTTGGGCGCCACCTCG-TCGACCTCATCCAGATCGCCCAACGCCTCGGTCGCGAG 

TT3GGCCGCGATGTCCCTGTCGTC7CGCTCTACCAACACCGCACCGTACGCGGGCTGAGCCGC 
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TTCCTCGGCGGCGCGCTCCAATCCGCGCGGTCCGGCGTCCCGACGGGCGCTGCCGCACCGGGC 
GCCGCCACGCCGGGGGTTGCCACCCCGCCGCGGCCACAACCGTCGCGCCAGCACCTGGAAAAA 
CGCCGTCAATTGAGGAAAAAAGGGGGGCCTTCCCATCATGAG 

(2) peptide sequence 

Seq ID No 44 ( >pEPOcos6_ORF5 . pep) 

MKVDKRNVDD I LGLTPTQTG I LYHYLLDPQADAYFEQLTLHLEGPLDVAR FRRAWERWAAHD 

QLRAVFRWQGIEHPVQIILKQHVPDLELAEVPRDADPAAFLAQWVAADRARKFDFETVPFRIG 

LCRTDTQHHVMLLSNHHILMDGWSTGLILRDFLACYGDSENWRPRTRTHFKAFIKWHQNRPRR 

GEERFWRDLLRDAPDGGFPRLGVEEGTRHSLDFGARSRALDDRLTQGLRDMARDLDVTLAAML 

HTAWGLLLQRYQNSCEVIFGTTVSGRNVELAGLDEWGLFINTIPFRFSAAAATTPVEAFRAV 

QRXLLARSEFEATPLVDIKGWSGLGPGAELFDTILVIENYPLDRAIFESDSSLRLTDHOIFER 

TNYGLTLTIETFSRLHVTLAHRRDLLGDAAAERMLDHFTGLLQAMLRFPHQPFARLEMKSEHE 

AHRVLHQLNQTRQPLPSQSAFHQLFFEQAQADGARPALWCGATRWTYGQLLERAUilaAGRLQE 

AGFARGDVAAVSLGPVPDLIPGLLGPLFAGGAYLPLDPTLPAQRSRFILDDAGCRFLISDAPL 

AGPTPIHPDPAGASPVDVIFACQDGAAQPAYLIYTSGSTGQPKGVWVSHRNLINFLTGMSAIL 

PVAADDVFLSLTTVSFDIFGLETWFPLSRGCTIVLGTRAEQLDPAAAAKAISCHGVTVYQATP 

SRLQLQLEHPTFVRAIGSLTTLLVGGEPLPAELLRRVREVTDARIFNLYGPTETTIWSTAGEV 

TAADVPDXGRPIANTGVFLLARDGS IQPPGLVGELCIAGEGVALGYHRRPDLNRERFREI PPG 

RLPFAGKLYHTGDI^RWTEDGRLLCLGRLDDQLKVRGHRVEPGEIEAVMARHPAVTQAVVVTR 

PRI^JGBPVLVGFWTAEGEPMPEEALSAYIADRLPSYMVPERCIIJiKAiyiPLTGNGK^ 

FALTESTRQAAPRTLARTAGEHRVAELWQALLRREAIGLDEPFFQAGGNSFGLIRLHAKI.ESA 

FGKS FP I TDLFQHTS I RSQAEMLSGSSVEAPLAGAVPQPPAAAAQVAS SAAKS PGERGAAATS 

SGLTAQPPQPHFRPIAVIGLAGRFPAAPDLDAFLELLTEGRCGIRFFSQAELRDEGLDANRIA 

CH^n^VPAKGFLDRADHFDADFFGIPPRDAEITDPQIRLLLECCWNALEHAGYPPGGGEIGLFA 

GSSANYHWLEYVGISEESSNRFAVMIQNEKDYLATRIAYQLDLKGIAVTVQTACSSSLTAVEL 

ACDALHAGRVTOALAGGVGLTYPLRAGYl^EDGMIFSPIXSRCRAFDAQAAGTVCGNGLGIWV^ 

KQLDAALADGDA I HAV I KG I AANNDGAAKI G YTAPSQNGQARVI RAAHRLAQVAPET I G YVE A 

HGS3TPLGDPIEVAGLTEAFDSPRHGFCALGSVK3^A/GHLDAAAGIAGFIKAVLSLSHRTLFA 

SLHVDTPNFQIPFADGPFQVNTETRPWPAADHPRRAGVSSFGIGGTNVHAVLEEAPQIAE^^ 
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RRRERQLFLVSARTAADLERRTAALVRHLAAHPDLAPDDVAFTLHAGRKPMTHRRFLVAADLA 
EAAARLAEPDPVKSAAARADRCQVWMFAGLGSQYPGMCGGLYRTEPAFREQVDRCFDLLAPRC 
DLKP S LFPE PDQAI DAS ALAA I DTAQ I AVFVCEYALARMLEGWGLRPDRL i G YS FGEYVAACL 
AGVFSLPDAIAIVRERGRILAAAEPGAMVSVPLPAERVASLLEPPLALAIDNGPSCWSGPVE 
PVRTFTARMKRDRVVATTPI^AER PMHSPLMAEAGGSLRAMIAGFRLNAPRI P I LSK^ 
DEQAIUDPDYWARHLCGNVRFADGVRTLLAERDPVFLEFGPGRDLSSLVRHOMPEGADEPIALI 
RHREDPVRDEDLLLDGLGRCFLRGATLHGQALYAGRGCRRVPLPGYPFQGPRCMPARAGLPGL 
ARPTVGATTISYRPAWKRAPRLAAVESLAPQSWLVFSDGSELAGELVAGLRASGCATTLVEGG 
LAFARFAGGFRAWPREEQDLAQLFATLSAEAMLPTHILHLLSLPSPERDSPLARLEHLTELGF 
HHLLAIARQLEAVGAPEVRLAWTTGLAAIGGESELRPEVGLLRGPVRVIPFEFPNLRLRLID 
LDSADPIWRSGCEPLLREMGAAPGPEEIALRGTSRWELGYEPVEGGTVSTISSRLREGGVYLI 
TGGLGGLGLALARHLARKYRATLI LAGRRGAPARELWHQAPAEFVPVAAAI AQMEECGARVI P 
VAII)VTDADQVNALFATIEATVGKIEGWHMAGIVDGGIIRTRTRAASDAVLAPKTVGTWI^ 
RALRGAGGRFLVLYSS I NAWAPFGQVAYAAANAFLDAFASAHEKDERLFRVSIGWDTWREAG 
MAVDAARARGDQAPLEGLSDEQGLRLLESAIiVGCEPRLLVS I SELRARIiAEHHRNGGI PRLliG 
PRANEAGAADSGESGATQDAS P ARRARPDL WAFA PAGN ELERR I VA 1 1 GA YLRLGQ VGVDDN 

FNDLGATSLDLIQIAQRLGRELGRDVPWSLYQHRTVRGLSRFLGGALQSARSGVPTGAAAPG 
AATPGVATPPRPQPSRQHLEKRRQLRKKGGPSHHE 

pEPOcos6_ORF6 sequences: 

(1) nucleotide sequence 

Seq ID No 45 ( >pEPOcos6_ORF6 . seq) 

ATGAGTGAAGTATCCATTCGCCCCGGCTTGGACATCGCGGTCATCGGCATGGCCTGCCGCTTT 
CCCGGTGCCCGCAACCTCGCCGAGTATTGGGCCAACCTGATCGAAGGCCTCGAAACGCTCAGC 
TTCTTCAGCGAAGAGGAGCTGCGTGAGGCCGGCTGCGATCCGGTCCAACTGGCCCAGCACAAC 
TACGTGCGCACCAAGGGCCTGCTCCCTGACGCAGACCGTTTCGACGCCGATTTTTTTGGTTAT 
TCCCCGCGCGAAGCCCAGCTGAtGGACCCCCAGATCCGCGTCTTCCACGAGGTCTGTTGGCAG 
GCGCTGGAGCACGCGGGCTACAACCCGCATCGCCACACCGGCACGATCGGCCTGTTCGCCGGC 
GCCGCGCCCAACGTTTTTTGGGAGTTTCTCTCCTATCGGTCCGATGCCGCCAATTTAGGCAAC 
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TTCACGCTGGGCCTGCACAACAACAAGGACTACCTGAGCTCGCGCATCGCCTACAACTTCAAC 

CTGACAGGGCCCAGCTACACCCTGTTCACCGCCTGCTCGACCTCGATGGTCGCCATCCACCAG 

GCCGTCCAGGCGCTGCTCAACGGCGAATGCGACCTGTGCATGGCCGGCTCGGTCTCCATTACG 

CTGCCACTGGTTGCCGGCTACACCTACACGCCGGGCATGATCGTCTCGCCCGACGGCCATTGC 

CGCACCTTCGACGCAGGCGCCAATGGCACTGTCTACGGCGACGGGGCCGGCGTGGTCGTTCTC 

AAGCGGGCCGAGGATGCGTTGGCCGACGGCGACCACATATTTGCGCTCATCAAGGGCTCGGCG 

CTCAACAACGATGGCAGTCGCAAGACCGGCTACACCGCGCCCAGCGTGCAGGGGCAGGTGGAG 

GTGATCCGCGCGGCGATGAACCTGGCGGAGGTCGAGCCGGAGGCGATCAGCTACGTGGAAACC 

CACGGGACGGGCACCACGGTGGGCGATCCGCTGGAGTTCGAGGCGCTAAAGGAGGCCTTCGGA 

GGTGGCTGCAAGGCCTTCTGTGGATTGGGTTCGGTCAAGCCGAACATCGGCCATCTGGACGTG 

ACGTCGGGGATCGCGAGCTTCATCAAGCTGGTCCTGGCGCTGGAGCACCGCATCCTACCGCCC 

ACGCTCCACTTCCAACTGCCCAACCCGAAGATGGATGTGGTCGATAGCCCCTTCTACATCGTG 

GCTGAGCGCGAACCCTGGCGCGAAGATCTGCTGCCGCGTCGGGCCGGTGTCAGCGCGTTCGGT 

CTGGGTGGCACCAACGTCCACATGATTTTGGAGGAGTTTCAGCGCGAACCGGCGGCGAACAGC 

GCGCGCACGCGCCACCTGACGGTGCTGACGGCGCGGTCGCCGCAAGCCCTGGCGCAGCTGGCG 

GCCAACCTCGCCGAACACCTGCGCGAACACCCCGAGTTGGCGCTGGCCGATGTGGCCCATACG 

CTGCTGCACGGCCGCAAGCCACATCCATTCGCGCGCATCCTGGTGGCGACCGATACGACGGCG 

GCGATCGACGCCTTGATGAACGACCGCGATCCGCGAACGCGTTTCTTCGAAGCGACCGGGCGC 

GGCGAGTCGGTGATCCTGTGTTTTGACGAAACGCCGCCGGAGCCGCGAAGCGCCCGCTACCTC 

TGGGATCACGAGCCGCTTTATCGCGCGGCGGCGACGTCGTGCTTGGCTGGTGAGGTCGCCGAC 

CCGGATCTGGAAGGCTGCTTTACTGCCCTGATCGCCGAGCAGGGCGCGGCAGCCGCCTTTTGC 

CACCAATACGCGCTGGCCGGATGGCTGCTGGCCATGGGGTTGACCCCGTCGGCGTTGATCGGC 

GTGGGCCAGGGCGAGTGGGTAGCAGCGGCGCTCGCGGAGGTGTTCCCGCCATCGGCCTGCTTG 

CGCTGGATTAGGTTCGGCGAACGGCTCCCGCAGCCGCGCGATCAACGGATTCCGTTTCTCTCC 

AATTTCTCTGGAAACTGGATCGTTC-GGCGTGAGTTGGCCGACCCGGATTACCCCAGAAAGCAG 

AAGGGTAAGCGCTGCATGAAGCGCCGTCGGTCCCAACCTCGGTCAGCTGGTGCAGGATGGGGG 

CGATGGAACCGGCTCGGTCAGCTCGTCGCGCGCTGCTCTTCCGCGGGAAGCGGAGGCGGGACG 

GTGATCGGCCCGAGGGCGAGGTTCATCTCGTCGTCGACGAGCCGGGCGCGGGTGCGCGCCCAG 

TACCTGGGGGCGAGCTCGAGG 
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(2) peptide sequence 

Seq ID No 4 6 ( >pEPOcos6_ORF6 . pep} 

mCRFPGARNLAEYWANLIEGLETLSFFSEEELREAGCDPVQLAQHNYVRTKGLLPDADRFDA 

DFFGYSPREAQVMDPQIRVFHEVCWQALEHAGYNPHRHTGTIGLFAGAAPNVFWEFLSYRSDA 

ANLGNFTLGLHNNKDYLSSRIAYNFNLTGPSYTLFTACSTSMVAIHQAVQALLNGECDLCMAG 

S VS I TLPL VAGYTYTPGM I VS PDGHCRTFDAGANGTVYGDGAGWVLKRAEDALADGDH I FAL 

IKGSAIJJNDGSRKTGYTAPSVQGQVEVIRAAMNLAEVEPEAISYVETHGTGTTVGDPLEFEAL 

KEAFGGGCKAFCGLGSVKPNIGHLDVTSGIASFIKLVIALEHRILPPTLHFQLPN'PKMDVVDS 

PrYIVAEREPWREDLLPRRAGVSAFGLGGTNVHMILEEFQREPAANSARTRHLTVLTARSPQA 

t AQLAANIAEHLREHPEIAIADVAHTLI^GRKPHPFARI LVATDTTAAI DALMNDRD 

EATGRGESVILCFDETPPEPRSAHYLWDHEPLYRAAATSCLAGEVADPDLEGCFTALIAEQGA 

AAAFCHQYALAGWLLAMGLTPS AL I GVGQGEWVAAALAEVFPPS ACLRW I RFGERLPQ PRDQR 

IPFLSNFSGNWIVGRELADPDYPRKQKGKRCMKRRRSQPRSAGAGWGRWNRLGQLVARCSSAG 
SGGGTVIGPRARFISSSTSRARVRAQYLGASSR 

pEPOcos6_ORF7 sequences: 

(1) nucleotide sequence 

Seq ID No 4 7 ( >pEPOcos6_ORF7 . seq) 

ATGGAACCGGCTCGGTCAGCTCGTCGCGCGCTGCTCTTCCGCGGGAAGCGGAGGCGGGACGGT 
GATCGGCCCGAGGGCGAGGTTCATCTCGTCGTCGACGAGCCGGGCGCGGGTGCGCGCCCAGTA 
CCTGGGGGCGAGCTCGAGGTAGCGGTCCCGCGGCCAGTAGGGCATCGCGCGAATGACGTCGGC 
CAGGTAGGCCTCCGGGTCGAGCCCGTGCAGCTTGCAGCTCGCCACGAGCGAGAAGAGGTTGGC 
CGCGGCGGAGGCGTGGTCGTCGCTGCCGAAGAAGAGCCAGGACTTTCTCGCAACCGCAATGGA 
TCGCAGCGCTCGCTCGCTGGCGTTGTTCTCCAGGCGCAGCCGACCGTCGTCGAGGAAGCGCCG 
CAACGGCTGCTCTTGGTTGAGGGCGTAGCCGAGCGCGGTGGAGACCAGGCCGCGCTCGCGGGG 
ACGAGCGTGCTCGGCCCTGGCCCAGGCAAAGAACGCGTCGACCAGAGGGCGGACGACGACATC 
GCGACGCACCTTGCGCTGCGCGG3CGGCAGGTCCGCCAGCGCGCGATCGGCGGCAAAGAGGGC 
GTTGATGCGCCGCAGCCCCTCGACACCGAGCTCGTGCTTGCAGACCGCCGCCTCCCAGAAGTT 
GGTACGGCAATGCGACCAGCATCCGACTTCGGTCGGGGGCGGACCGCGCTTCTCGTCGGCAGC 
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AGCGCCTCTTGGTGGTGTGCCGCGGAAGAGGGCGTCATAGATGGCGTGAGCGTCAGCTTGAAT 

ATACCGAGAGAAGCCGCGGAACATCTCGCAGACCGCGGCGCTGGTATGCTTGGGCTGGTACTC 

GAAGAAGACGTGATCCTTGTCCGCGAGGACGACGAAGAAGTGTCCCTTGCGGCACGGCCCGGG 

CTTCTTGTCCTTGCGCTCCTGGATGGGCCCAGGCTGGACGGAGACCCCGGTGGCGTCCGTGGA 

CAGGCAGAAGGCGGTCTCGAAGGCCTCTTTGCGCGCGGCCTCGACGATGGCGCCCAGGGTCGC 

ACCGACGTCTTCGGCGTAGCGGCACATCGTGCCGCGATCGAGCGACGCGCCCTGAAGCTCCAG 

CTGCTGCTCCAGTCGATAGAACGGGACGCCGAGCAGGTACTTGCTGGTGAGGATGTGCGCAAT 

CATCGACGGCGCGAGGAACGACCGCCGGAACAACTCCTTCGGAAGCGGCGTCGTGATGAAGAC 

CGTGCAGGTCTCGCCCTTCGGCGCCGGCGGCGGCGCGTCGAGCGAGGGCGCGTCGAGCGCTGT 

GGAGGAAGCGCTCGGCTCGCCGGCCGCTGCCGTGTCCTCCGGGCTGACGCTCGCAGCGGGCGT 

CGGCGTCGGGGCTTCTCTCGCGACGACCTGGAGCGGGGCCGCTTCCTCCTCGCCCGAACTGCT 

CGCATCCGTGACGGACCGCTCGGCCTTGTACACGACGCGTGCGAGCACGATGCGGCGCATTCC 

GCCGCGCTCGTAGCCGAGTCGCGAGGTCTCCTCGACCCCGATGCGCGTCGCCGTCGCATCGAG 

CTCGGGGCAGGAGAGCTCGATGCGGACGACGGGCAGGTCGGACTCGGACAGGTCGCGACGGCC 

CTTGCCGCCGGACCTTCGTTTCGGCCCCTTGGGGTCGTCGTGCTGCCGCTCGTCGCCTGTATT 

GCGCTCGGCGGCGTCGAGTGCCTTCGCGAGGCGCTGGACCTCGAGGAACATCGAGTCGAACGC 

CAGCTGCTCCGCGCTCACCTCGGCGCGCTCCGCCTTGGCCACGAACAGTCGACGTCGCAGAAG 

CTGCAGCTGCTCGAGCGCACGGGTGTAGGCGCGCCGAAGCTGCGCGAGCGCATCGCGCGCTCC 

CACGAGCTCGCTCTTTGCCGCGGCGAGCTCCGCTTCGAGCTGCGCGATGCGCTGCTGCTCGGC 

CGAGAGCGTCGGGTTGGCGGCGGCGTCGTGCACGACGCCGCTCTACGTAAGCCGCGCGTACTT 

GTCGAGCGAATTCGTGCGGCTCAGTGGACGCGGCGCGGTGCGCGCCTTCGCGGTTTGGACGTG 

GGCGCGATCTCGATGCCGTCGAGCAGCGTCTCGAGCGTGGCGTCGTCCACCTCGACGTGCGTG 

GCGCCCTCGGTCGGGGGGTCGGGAAGTGCGAACGCTCCGCGATCAAGGCGTTTTGAAAACAGG 

CAGATTCCACTGCCATCGAAGAAGAGAATCTTGATCGTGGTCCGCCGCTTGCCGACGAACGCG 

;iACAGCGCTCCGCAGCGAGCCTCGTACCCCACACGCTCACGGATGAGACCCGAAAGCCGCTCG 
AAGCCG 
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(2) peptide sequence 

Seq ID No 48 ( >pEPOcos6_ORF7 . pep) 

MEPARSARRALLFRGKRRRDGDRPEGEVHLWDEPGAGARPVPGGELEVAVPRPVGHRANDVG 

QVGLRVEPVQLAARHEREEVGRGGGWVAAEEHPGLSRNRNGSQRSLAGWLQAQPTWEEAP 

QRLLLVEGVAERGGDQAALAGTS VLGPGPGKERVDQRADDD I ATHLALRGRQVRQRA I GGKEG 

VDAPQPLDTELVLADRRLPEVGTAMRPASDFGRGRTALLVGSSASWWCAAEEGVIDGVSVSLN 

I PREAAEHLADRGAGMLGLVLEEDVI LVREDDEEVSLAARPGLLVLALLDGPRLDGDPGGVRG 

QAEGGLEGLFARGLDDGAQGRTDVFG VAAHRAAI ERRALKLQLLLQS I ERDAEQVLAGEDVRN 

HRRREERPPEQLLRKRRRDEDRAGLALRRRRRRVERGRVERCGGSARLAGRCRVLRADARSGR 

RRRGFSRDDLERGRFLLARTAR IRDGPLGLVHDACEHDAAHSAALVAESRGLLDPDARRRRI E 

LGAGELDADDGQVGLGQVATALAAGPSFRPLGVWLPLVACIALGGVECLREALDLEEHRVER 

QLLRAHLGALRLGHEQSTSQKLQLLERTGVGAPKLRERIARSHELALCRGELRFELRDALLLG 

RERRLGGGWHDAALRKPRVLVHRIRAAQWTRRGARLRGLDVGAISMPSSSVSSVA3STSTCV 

APSVGGSGSANAPRSRRFENRQIPLPSKKRILIWRRLPTNANSAPQRASYPTRSRMRPESRS 
KP 

pEPOcos6_ORF7 . 1 sequences : 

(1) nucleotide sequence 

Seq ID No 4 9 (>pEPOcos6_ORF7 . 1 . seq) 

ATGTTCCTCGAGGTCCAGCGCCTCGCGAAGGCACTCGACGCCGCCGAGCGCAATACAGGCGAC 
GAGCGGCAGCACGACGACCCCAAGGGGCCGAAACGAAGGTCCGGCGGCAAGGGCCGTCGCGAC 
CTGTCCGAGTCCGACCTGCCCGTCGTCCGCATCGAGCTCTCCTGCCCCGAGCTCGATGCGACG 
GCGACGCGCATCGGGGTCGAGGAGACCTCGCGACTCGGCTACGAGCGCGGCGGAATGCGCCGC 
ATCGTGCTCGCACGCGTCGTGTACAAGGCCGAGCGGTCCGTCACGGATGCGAGCAGTTCC'GGC 
GAGGAGGAAGCGGCCCCGCTCCAGGTCGTCGCGAGAGAAGCCCCGACGCCGACGCCCGCTGCG 
AGCGTCAGCCCGGAGGACACGGCAGCGGCCGGCGAGCCGAGCGCTTCCTCCACAGCGCTCGAC 
GCGCCCTCGCTCGACGCGCCGCCGCCGGCGCCGAAGGGCGAGACCTGCACGGTCTTCATCACG 
ACGCCGCTTCCGAAGGAGTTGTTCCGGCGGTCGTTCCTCGCGCCGTCGATGATTGCGCACATC 
CTCACCAGCAAGTACCTGCTCGGCGTCCCGTTCTATCGACTGGAGCAGCAGCTGGAGCTTCAG 
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GGCGCGTCGCTCGATCGCGGCACGATGTGCCGCTACGCCGAAGACGTCGGTGCGACCCTGGGC 
GCCATCGTCGAGGCCGCGCGCAAAGAGGCCTTCGAGACCGCCTTCTGCCTGTCCACGGACGCC 
ACCGGGGTCTCCGTCCAGCCTGGGCCCATCCAGGAGCGCAAGGACAAGAAGCCCGGGCCGTGC 
CGCAAGGGACACTTCTTCGTCGTCC7CGCGGACAAGGATCACGTCTTCTTCGAGTACCAGCCC 
AAGCATACCAGCGCCGCGGTCTGCGAGATGTTCCGCGGCTTCTCTCGGTATATTCAAGCTGAC 
GCTCACGCCATCTATGACGCCCTCTTCCGCGGCACACCACCAAGAGGCGCTGCTGCCGACGAG 
AAGCGCGGTCCGCCCCCGACCGAAGTCGGATGCTGGTCGCATTGCCGTACCAACTTCTGGGAG 
GCGGCGGTCTGCAAGCACGAGCTCGGTGTCGAGGGGCTGCGGCGCATCAACGCCCTCTTTGCC 
GCCGATCGCGCGCTGGCGGACCTGCCGCCCGCGCAGCGCAAGGTGCGTCGCGATGTCGTCGTC 
CGCCCTCTGGTCGACGCGTTCTTTGCCTGGGCCAGGGCCGAGCACGCTCGTCCCCGCGAGCGC 
GGCCTGGTCTCCACCGCGCTCGGCTACGCCCTCAACCAAGAGCAGCCGTTGCGGCGCTTCCTC 
GACGACGGTCGGCTGCGCCTGGAGAACAACGCCAGCGAGCGAGCGCTGCGATCCATTGCGGTT 
GCGAGAAAGTCCTGGCTCTTCTTCGGCAGCGACGACCACGCCTCCGCCGCGGCCAACCTCTTC 
TCGCTCGTGGCGAGCTGCAAGCTGCACGGGCTCGACCCGGAGGCCTACCTGGCCGACGTCATT 
CGCGCGATGCCCTACTGGCCGCC-GGACCGCTACCTCGAGCTCGCCCCCAGGTACTGGGCGCGC 
ACCCGCGCCCGGCTCGTCGACGACGAGATGAACCTCGCCCTCGGGCCGATCACCGTCCCGCCT 
CCGCTTCCCGCGGAAGAGCAGCGCGCGACGAGC 

(2) peptide sequence 

Seq ID No 50 ( >pEPOcos6_ORF7 . 1 .pep) 

MFLEVQRLAKALDAAERNTGDERQHDDPKGPKRRSGGKGRRDLSESDLPWRIELSCPELDAT 
ATRIGVEETSRLGYERGGMRRIVLARVVYKAERSVTDASSSGEEEAAPLQVVAREAPTPTPAA 
SVSPEDTAAAGEPSASSTALDAPSLDAPPPAPKGETCTVFITTPLPKELFRRSFLAPSMIAHI 
LTSKYLLGVPFYRLEQQLELQGASLDRGTMCRYAEDVGATLGAIVEAARKEAFETAFCLSTDA 
TGVSVQPGPIQERKDKKPGPCRKGHFFWLADKDHVFFEYQPKHTSAAVCEMFRGFSRYIQAD 
AHAIYDALFRGTPPRGAAADEKRGPPPTEVGCWSHCRTNFWEAAVCKHELGVEGLRRINALFA 
ADRALADLPPAQRKVRRDVWRPLVDAFFAWARAEHARPRERGLVSTALGYALNQEQPLRRFL 
DDGRLRLENNASERALRSIAVARKSWLFFGSDDHASAAANLFSLVASCKLHGLDPEAYLADVI 
RAMPYWPRDRYLELAPRYWARTRARLVDDEMNLALGPIT\^PPPLPAHEQRATS 
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pEPOcos6_ORF7,2 sequences: 

(1) nucleotide sequence 

Seq ID No 51 (>pEPOcos6_ORF7.2 .seq) 

ATGATTCCGGCGGGCGTGCAGGTGTTCGTCGCGCTGGAGCCGGTGGACATGCGCTACGGCTTC 
GAGCGGCTTTCGGGTCTCATCCGTGAGCGTGTGGGGTACGAGGCTCGCTGCGGAGCGCTGTTC 
GCGTTCGTCGGCAAGCGGCGGACCACGATCAAGATTCTCTTCTTCGATGGCAGTGGAATCTGC 
CTGTTTTCAAAACGCCTTGATCGCGGAGCGTTCGCACTTCCCGACCCCCCGACCGAGGGCGCC 
ACGCACGTCGAGGTGGACGACGCCACGCrCGAGACGCTGCTCGACGGCATCGAGATCGCGCCC 
ACGTCCAAACCGCGAAGGCGCGCACCGCGCCGCGTCCAC 

(2) peptide sequence 

Seq ID No 52 ( >pEPOcos6_ORF7 . 2 .pep) 

MIPAGVQVFVALEPVDMRYGFERLSGLIRERVGYEARCGALFAFVGKRRTTIKILFFDGSGIC 
LFSKRLDRGAFALPDPPTEGATHVEVDDATLETLLDGIEIAPTSKPRRRAPRRVH 

pEPOco3 6_ORP7 . 3 sequences : 

(1) nucleotide sequence 

Seq ID No 53 ( >pEPOcos6_ORF7 . 3 . seq) 

ATGACAAGGACGAAGGCGACCGAAGTGATGTGGTCCGAGCGCGTTCGGGCGTGGCGCGAGAGT 
GGTGAAACGGCGGAGGAGTTCGCTCGGAGCCGCGGATTTGCGGCCTCGACGCTGCACGGCTGG 
TCGAGCCGGCTGTCGCGGGCCGAGCCACCGCGCTTTCTGCGCCTGGTGCCGAAGGCGCCCGCC 
GTGACGAGCAGCGCTGCGGAGCTCGTCGTCGAGGTCGGCGGCGCGCGGGTGCGCGTCGCCGCG 
GGGTTCGACCCCGCGCTGCTGGCGGAGGTGGTCCGTGCCCTCGGCGGAGCGGGGCGA 

(2) peptide sequence 

Seq ID No 54 ( >pEPOcos5_ORF7 . 3 . pep) 

MTRTKATEVMWSERVRAWRESGETAEEFARSRGFAASTLHGWSSRLSRAEPPRFLRLVPKAPA 
ITSSAAELWEVGGARVRVAAGFDPALIiAEWRALGGAGR 
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pEPOcos6_ORF8 sequences: 

(1) nucleotide sequence 

* Seq ID No 55 ( >pEP0cosS_0RF8 . sec) 
ACTGGACAGCGCAGCCGGGGTGAGACGGCGCTTCGCGCAGCGCTTACGCAGAAGGCGCGCCGC 
GCGCCATTGTCGGATGCGGTGCGCGACTTCGCCGCCGATCGGCTGTTGCTGGAACTGGGACAA 
CCACTGGACGTAACGGCTGAAGCGAGCCAACGGCTCCAGCTCGCGCGGGGCGACCTGTTCGGC 
GCCTACCAAGCGTTGGCCCAGCTCTGGATCTGCGGCGCCCTGGCCGAACCGCCGCGACTGTAT 
CCCGACGAACACCGCCGGCGCGTGCCGCTGCCGAGCTACCCCTTCGAGGGAAAGCGGTTCTGG 
ATCGAGGGCTCGCCGTTCGAAACCGCGCCCGCCGCCGGCGCCTCACCCCAACCCGCCGATTCG 
GGGGACATTCTCAAGGGCGACCCGGCGGACTGGTACTATCGGCCGCGTTTCGAAGCGGCGCCG 
CTCTTGCCCAGCCCGTTCGAGAGCGAACCCGGCGATTGGCTGGTGTTCGAAGATGAGCTGGGG 
CTCGGCGCCTGGCTGAGCGAGACCTTGCGCGACAAGGGCGCGCGGGTCGCGACAGTCGTTCGA 
GGCACCGAGTTCCGACGCCTGGCGTCACAGCGCTTCCAGCTTCGTCCCGATCGACGGGACGAT 
TACCGGACCCTGCTGCACGAGTTGAAGGCGCAGGGCATCGCGCCGGTCCACCTGTGCCACCTA 
TGGAGCGTGACCGCCGCACCGGATGCCGAGCAGTTGCTCGACGTCAGCTTTCACAGCCTGGTC 
CATTTGGCGGCCGCTTTGGGTTCGGTTGGCTACTTCCACGCCATG 

(2) peptide sequence 

Seq ID No 56 ( >pEPOcos6_ORF8 . pep) 

TGQRSRGETALRAALTQKARRAPLSDAVRDFAADRLLLELGQPLDVTAEASQRLQLARGDLFG 

AYQALAQLWICGALAEPPRLYPDEHRRRVPLPSYPFEGKRFWIEGSPFETAPAAGASPQPADS 

GDILKGDPADWYYRPRFEAAPLLPSPFESEPGDWLVFEDELGLGAWLSETLRDKGARVATWR 

GTEFRRLASQRFQLRPDRRDDYRTLLHELKAQGIAPVKLCHLWSVTAAPDAEQLLDVSFHSLV 
HLAAALGSVGYFHAM 
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pEPOcos6_ORF9 sequences: 

(1) nucleotide sequence 

Seq ID No 57 ( >pEPOcos6_ORF9 . seq) 

ATGAAGTTGAACGTGGTCGCCAACCGGCTATTCGACCCCGAGTCGCCCGAGCGCACCGAGCCC 

GCCAAGAGTCTGTTGCTCGCGGTGACCAAAGTCCTGCCGCAAGAGGTGCCCAACGTTCGAACC 

CGCGCCATCAGCGTGGACCTGGATCGCTCGTTCGACGCGGCGGCGCCCGCCTGGGCCGCCAGT 

TTGTT3GTTGAATGCGGCGCGCCCGTCGAGGAAACGGTGGTGACCTACCATGGCGCAGCCCGA 

TGGCTGCGCCGCTTCGATCGCGTTGCGGTGAATGGTCTCGGCCCGTTCCACCCCGATCAACCT 

GCGCCGCTGCTGCGCGAGCGCGGCGTGTACCTGATCACCGGCGGCCTGGGCGGCGTGGCTGGC 

CAGTTGGCGCGCTACCTGGCGCGGGCCTGCCGGGCGCGGTTGGTGCTCACCGCGCGCCGGCCC 

CTGCCCGAGCGCGACCAGTGGGATCGGGAGTCGGCCGTGCTGTCATGGGACGACAAGACGCGC 

CAGCGCATCGAGCTGGTGCGCGAGCTGGAGCGGCTGGGGGCCGAAGTATTGGTGGTGGCTGCC 

GATGTCGCCGACGAAGCGGCCATGGCGCAGGCGATCGAGGCCTCACTGGCGCGATTCGACGCT 

TTGGACGGCTTGATCCACGGCGCCGGGATCGTGCGGGTCGCGTCGGGCCGCACGCCGATCGGG 

AGTATGACGCGGGCCATGTGCGAGGAGCAGCTCCGCCCCAAGATGTTGGGCCTCGACGTCGTC 

GACCGCCTCCTGCGCGATCGCCGGTTGGACTTCCGCATTGCCATCTCGTCGCTCGCCCCGATT 

CTCGGCGGCCTCGGCCACGTCGCCTACGCCGCCGCCAACCTCTACATGGACGCGTTCGCGACG 

CGCGCCGCCGCCGGCAACGCGCCTTGGATCGCGCTGAACCTGGCCGAGTGGGAATACGAGGGC 

CCGGCTACCTACGACGAGCGGGTGGGCCGTTCGCTCAAGCAGCTCGAGCTCACCAACGAGGAG 

GGTATCCGCGTCTTCCAGACGGTGTTGGCCTTGGCCGCGCGCGGCCCGCTACAGCAGATCATT 

ATTTCCACCGGCGACCTCCAGGCCCGCCTCGACAAATGGATTCACATCAAATCCCTGCATCGC 

CGACCGGGGCCGGTCCAGCTCAGTCGCCGGACCGCGGCACCCCAGGGCGGTTTCGGCTCGGAG 

CGCGCCGCCTTCGAGGCCGCCTTCGCTGACGCCTGGTGCGACTTCTTCGGGGTTGAAGAGGTC 

GACCCGAACAAAAACTTCTTCGATCTGGGCGCCAGCTCGCTCGACTTCATCCACCTCGTCAGT 

CGCTTCAGCAAGGCCATCGAACAGCATGTACCGCTCGAGGCCCTGCTCGAACACTCCACCCTG 

CACGACCTCGCCGCCCACCTCGCGGGCGACGCGAACACCGACGCCAGCGACGAAGCGCGCATT 

CGCCAACGGCTGCAAGGCGCCAAGTCCGGCGACATCGCCATCATCGGCATGGCCGGCCGCTTC 

CCGCTCGCGCCCGACCTGGACACCTATTGGCGCAACCTGGTCGGAGGCATCGACGCGGTCAGC 

TTCTTCAGCGCCGAGGAGTTGCGTGCTGCTGGCGTCACCGCGGCCGAGATCCACCACACCAAC 
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TACGTGCCGGCCAAGGGGCGCTGC3CCGACCAGGACTTGTTCGATGCGGCCTTCTTCGAATAC 

ACTGCCAGCGACGCCGAGCTGATGGACCCGCAAAATCGCGTGTTACACGAGGTCGTGTGGCAC 

GCGCTGGAAGACGCCTGTTTCGACTTCAACGGCGATCACGGCCAGGTCGGCCTGTTCGCGGGC 

GCCTCGCCGAACCTGTGGTGGCAGTTCGTGGCCAGCTTTTCCGAGGCCGCCAAGACGCAGGGC 

ATGTTCACCACCACCCTGCTCAACGACAAGGACTCGATCGCGAGCCAGATTTCATACAAGCTC 

GGTCTAAAGGGCCCCGCGGTCACCTTGTTCACCGGCTGTTCCACCTCGCTGGTAGCCGTTGAC 

GCCGCCTGCCGCTCGATCTGGTCCGGTCAATCGGACATGGCCGTGGCCGGCGCGGTCTCGCTG 

ACTCTCCCCGATAAGGCCGGC7ACATCTACGAAAAGGGCATGCTCTTCTCGGCCGACGGGCAT 

TGCCGGGCTTTCGACGCCAACGCCACCGGCATGGTCTTCGGCGACGGCGCCGGCGCGATCGTG 

CTCAAGCCGTTGGACGCGGCCCTGCGCGACGGCGACCCGATCCATGCGGTGATCAAGGGCTGC 

GCCACCAACAACGACGGCGACCGCAAAGCCGGCTACACGAGCGTCAGCGCCCAAC-GCCAGGCC 

GAGGTGATCCGCTCGGCCCAGATCCTGGCCGACGTGGCGCCGGAATCCATCAGCTACGTGGAA 

GCCCACGGTACCGGCACCAAGTTGGGCGACTCGATCGAGATCAAGGCGTTGAAGCAAGCCTTC 

GCCAGCGACAAGAACGGATTTTGCGGCATCGGGTCGGTCAAGACCAACCTCGGTCACCTGATG 

GCGGCGGCGGGGATGGCCGGCCTGATCAAGACGGTTCTGGCGATGAAGCACCGCCAATTGCCG 

CCATCGCTGCACTGCGACGAAGTGAACCCCGACCTGGAGTTGGAGCGCAGTCCGTTCTACATC 

AACACCCGCCTGCGCGACTGGGTTGCACCGGGCGGGCCGCTGCGGGCCGGCGTGAGTTCGTTC 

GGGATCGGCGGAACCAACGCTCACGTCATCCTGGAGGAGCCGCCGACGCGCGAGAGCGGCACG 

CGCATGCGCCACTGGAAATTATTGATGCTGTCGGCGGCCAGCGAGGCGGCGCTCGACCGCCAG 

GCCGATAACGTGGCCGACTACCTGGAGCGCCATCCCGAGGCCCACCTCAGCGACGTGGCCTAT 

TCCCTCCAGACCGGCCGGCGCGTTCTGGCCTGGCGGCGCACGGTCCTATGCGAGTACCGCGAG 

GACGCGGTGACCAGTCTGCGCGAGCGACAGGCCAAGCGCGTCCAGACAAGTCGCGTCCGCTGG 

GACCACAAGGACGTGGTCTTCATGTTTCCCGGTCAGGGCGCCCAGTACCTGAACATGGGCCGC 

GACTTATACGTCATGGAGCCGGTCTTCCGCGAGGTCATGGACCGCTGCTTCGAGTTGCTGGCC 

CCTTTGTGGTCCGAGCATCCGCGCCAGATCCTTTATCCGGAGGGCGGGGTGTCGACCCTGCTC 

CACCGGACTGATTACACCCAGCCGATCGTGTTCTGCTTCGAGTACGCCCTCGCCCATTTGCTG 

CTCTCCTGGGGATTGAAGCCGGCCGCGACCATCGGCTACAGCTTCGGCGAGTACGTTTCTGCC 

TGCCTCGCCGGCGTCTTGTCCCTGGAAGATGCGATCCGTCTGGTGACCGAGCGCGGTCGGCTG 

ATGGCGGCTTTGCCCGCGGGCGCCATGCTCAGCGTCCCGGTTCCCGAATGCGAGCTGCTGCGG 

CTGCTGGACGGCTTCCACGCCCAATCGGCGGCCCATCTGGCGCTGGCCGTCGACAATGGCGCC 
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TCCTGCATTGTGGCCGGCGAGCAGGCCGCCATCTCGGCCTTCGAATCGATGCTTCGCAAGAAG 
CGTCTGTTGACCATGCGGGTCGCGGTCAGCCACGCCGCTCATTCGCAGGTCATGACCGGCGCG 
ACCGACGCCCTGCGCAGCATCCTGCGGAAGATCCCCCTCTCCGCGCCGACAATTCCCTTCATT 
TCCTGCGTCACCGGCACCTGGATCACTGCACAGCAGGCTACGGATCGCGAGTATTGGGTGAAC 
CACATGTGCGGGACGGTGCGGTTCGCGGCGGGTCTGACCGAGCTGGGTCAAAACCGCGAGGCG 
GTGTTCCTGGAAGTAGGTCCGGGCCGCGACTTGACGTTGCTGGCCCACCGCATCCTGGCCGAC 
AGCGCGGCCGTGTTCGAGCTGGTCAAGGCGCCCGACGGCGGCGACGACGATGGGTTCCTCCTG 
CTGGATCGATTGGCCAAGCTCTGGAGGCTGGGGATTTCGATTGACTGGGCCGGCTTCTACGCG 
GATGAGCGGCGGCGGAAACTCTCGCTGCCGGGATATCCGTTCGAGCGGCGGCGCTTCTGGATC 
GAGGGCAACCCGCTGGAGATCGCCGCCGGCAGGCCCAATGTCCAGGGGCCGCTGGTCAAGGCG 
TCGGACATCGGCGCTTGGTTCTACGTGCCGCAATGGCGGCGGTCGGTGCTCGCCGAGCCGGGT 
ACAACGGCGGCGGGCGCCGCCGTCACGGCGGAGCAGGCACGCGTCGTGACCGAGCTACGGGCG 
GGATGCGCGTCGGCCGGCTTGGGCAGCGGGGCCTGCGGACTGAATGGCGGTGCCCCGTCCGAG 
CGTCCGAAAGAAAGTGTAGCGCCAGCCGGGTCGACCAGCGCAGCGGCGCAGACCGGCGCGGAC 
TGCCCGACACCGACTGGGGAGCCAGCGGCTGTGCCAAAGGACGGGGCCGAGCCGCGGCCGACC 
TGGCTTATTTTCGCCGACGCCGGCGGATTGGCCGAATCTTTCGCCAAGCGGGTTCAGGCCCGC 
GGCGAGAAGCTTTACCTGGTGGCTTCCGGCTCGCGCTTCGAGCGCCTGGCCGAGACCCGCTTC 
CGCCTCGATCCCGGGGCCAAGTCCGATCACCGCCTGCTTTTCAAGGCGCTCGACGAGGCCGAC 
ATCCTGCCGACCCACCTCCTCGACTTCCGCTCGCTTGACTGCGGCGGGCCCGACGCCGACCCC 
ATGGACCAGGCCGGCTTCTTCGGGCTGTTGCACCTGGTCCAGGCGATGGCAGAGGCCGGCTAC 
AGCCATCCCATTCGGCTGCTGATCGTCAGTTGCGGCGTCTACGATGTCACCGGTGCCGAACCG 
CTGCAGCCGGCGCGGGCCACGATGATCGGACCGGCTCTGTGCATCCCGCAACAGTATCCGCAC 
CTCGAAACGAGCCATGTGGATTTGGGCGTGGTCCATGCCGACGAGCTCCACGCCGCGCGCCAG 
CTCGACAGCCTACTTGCCGAATGCCTAAGTGCAACGGCCGAGCGCCAATTGGCGCTGCGCGGC 
CGACACCGCTGGCTGCTGGACTACGAGCCAGTCCGCTTGCCGCCGCTCGACCCGGGCCGTCTG 
CCCTGGCGCCAGCGCGGGGTCTACTTGATCACCGGCGGTTTGGGCGGGATCGGCCGCATCCTG 
GCCGAACACCTGGCCCGCACGACCTCGGCTCGCCTGGTCCTAATCGGCCGCGAAACCCTGCCC 
GACCGCGACGACTGGGACGCCTGGCTGAACCGCCCGCAACCGGTCGACGCCACCCACGAACGG 
CTGCTGCACAAGATCCGCGCGATTCGCGATCTGGAAGCGCTAGGCGCCGAAGTCCTGGTCCTC 
GCCGCCGACGTCGCCAACGAAGCCGCCATGCGCGAGGCCTACGATCGCGCCGAATCCCACTTC 
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GGCACAATCCACGGGGTGATTCAC3GCGCCGGCCTGATGGACGCGCAAAGCTTCTCACTGATC 

GACGCCCTCGACCACGACCTCTGCGCCCGCCAGTTCGAAGCAAAAATCCGCGGCGTCTGCGTG 

CTCGACCGCGTTCTGGCCGACCGCACGCTCGACTTCTGTCTGCTGATGTCTTCCATCTCCACC 

GTGCTCGGCGGCCTGGGCTATTTCGGTTACGCCGCGGCCAACGCCTTCCTCGACGCCTTCGCC 

CAGGCGCGCAGCCGCGACGCCGCTTTCCCCTGGCTTAGCGTGGCCTGGAGCGATTGGAAGTAC 

TGGACCGAGCGCAAGATGGACAACGAGGTCGGCGCCGTCATCGACAGCCTCTCGATGGAACCC 

GCCGAGGGCTTCGAAGCCGTCACCCGCGTCTTGGCTTGGGGCAAGGCGCCCCACATCGCCAAC 

TCGCCCGGTGACCTCGGTCGCCGCCGGGATCAATGGGTCAAACTGGCCAGCCTGAAATCGGCG 

CACTCCAGCGAGCCCGAGCCGGCTAGGCATGGACGTCCGGCGCTCTCCAGCGAATGGGTCGCG 

CCGCGCAACGTGGTCGAAGAGAAGCTGGTCGCCATTTTCGAGCAGGTGTTCGGCACTGCGGCA 

CTGGGCATCGAGGACAACTTCTTTGAGTTGCGCGGCGACTCGCTCAAGGCGGTCATGACCGCG 

GCCCGTATTCAAAAGGAGCTGAACGTGGAAGTGCCGCTGCCGACCTTCTTCCAGATGCCCACG 

GTCGCTGGCCTGGCCCAGTTCGTGACGCAAGCCAAGCGCAGCGGCCGGGAGACGATTCGGCGC 

ACCGCGCCGCGCCCACATTACCCGCTCTCGGCTGCCCAGGGCCGCCATTACCTGCACTACCGC 

ATGGACCCGCGTTGTACCGCATACAACGATCCCTTCGCCAACCTGATCGAGGGTCCGCTGGAC 

GTGGATCGCGTGGAGCGCATCCTGCACACCCTCATCCTACGCCACGACTGCTTCCGCACCTCG 

TTCCACTTCCGCGAGGGCGAGCCGGTCCAGGTGATTCACGATCGGGTGGACTTCAACCTGGCG 

CGGATTACCTGCGCGCCCGAGGATTTGCCCGAACGGATGCGCGATTTCATCCGCTCCTTCGAT 

CTGGAGCGACCGCCCGCCATGCGCGCCGGCCTCTTCGTCACGGGGCCCGAGCGCCACGTGCTG 

CTAATCGATTTTCACCACATTATCACCGATGGCGTGTCGTTCGAGAACTTCGTCGGCGAGTTC 

GCGGCGCTCTACCGCGGCGAGATCCTGCCCGAGCTGGAACTCGAGTACAAGGATTTCGCGGTG 

TGGCAGCATGAGAACCGGGGCCGCCGCGCCAACAGCGACCAGGCCCGCTACTGGACCGAGCAG 

TTGGCCAATGCGCCCGGGCCGATCGAGCTAACCACCGATTTCCCCCGTCCCAGTCGACGCAGC 

TTCCGCGGCGACCGCGTGCGGACCGTGCTTGATGCGGAGCTCGTTGCTCGACTCAAAGAGCAC 

GCGGCGCGCCTCGGCATCACCCTCTATAGCCTGCTGCTGGGCGGATTCTCGTTATTGCAGCAC 

AAGCTCTCCGACTCGCACGACATCGTCATCGGTTCGCCCGTCGCGGGCCGCACCCGGAGCGAA 

CTCCAGGATCTGCTGGGCGCGTTCGTCAACACCCTGCCGATGCGCCACCGCATCGACCCGACC 

CATACCGCACGGGTCTTCTTGGAGCAGGTCCACCAGACAACCTTGGCGGCCCTCAGCTACCAG 

GAGCACCCTTTTGACGAAATGGTGGCGACGCTCGGGTTCGCCGCCGATCCGGCTCGCAACCCG 

ATCriTCGACACGATGTTCTTGCTGCAGAACATGGCCATGGGTGCAACCACCATTCCCGGTCTG 
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CGGCTCTCGCCTCACGACACTTTTCACCGCAAGGCATTaTGCGACCTGATGCTACAGGCGACC 
GAGTATGACTGCCACCTGGAGCTGGTGCTCGAGTTCGCCACCGACCTGTTCCGGCTGGAAACC 
GCGCAAGTCTTGCTCGACCGCTACCGCCAAGTCTTGGAGTGGCTGTTGGCGTACCCCCATGAA 
TCGATAGACGATTTGACGCTCGCCGGCCACTTTCGCGAAGTCGAAGTGACGATGTCGGACGAG 
GGCGACTTTGATTTCTCAGATTTCGAACCCCGCAACGTGAGAAACCTATGGCGCGCC 

(2) peptide sequence 

Seq ID No 58 (>pEPOcos6_ORF9 .pep) 

MKL^^A;7^RLFDPESPERTEPAKSLLiAVTICVLPQEVP^^^/RTRAISVDLDRSFDAAAPAWAAS 
LLVECGAPVEETWTYHGAARWLRRFDRVAVNGLGPFHPDQPAPLLRERGVYLITGGLGGVAG 
QLARYLARACRARLVLTARRPLPERDQWDRESAVLSWDDKTRQRIELVRELERLGAEVLWAA 
DVADEAAMAQAI EASLARFDALDGLI HGAG I VRVA3GRTP IGSMTRAMCEEQLRPKMLGLDW 
DRLLRDRRIXiFRIAISSXiAPILGGI/SHVAYAAANLYMDAFATRAAAGNAPWIALNLAEWEYEG 
PATYDERVGRSLKQLELTNEEGIRVFQTVLALAARGPLQQI I ISTGDLQARLDKWIHIKSLHR 
RPGPVQLSRRTAAPQGGFGSERAAFEAAFADAWCDFFGVEEVDPNKMFFDLGASSLDFIHLVS 
RFSKAIEQHVPLEALLEHSTLHDLAAHLAGDANTDASDEARIRQRLQGAKSGDIAIIGMAGRF 
PLAPDLDT YWRNLVGG I DAVS F FS AEELRAAG VTAAE I HHTN YV FAKGRCADQDLFDAAFFE Y 
TASDAELMDPQNRVLHEVVWHALEDACFDFNGDHGQVGLFAGASPNLWWQFVASFSEAAKTQG 
MFTTTLLNDKDSIATQISYKLGLKGPAVTLFTGCSTSLVAVDAACRSIWSGQSDMAVAGAVSL 
TLPDKAGYIYEKGMLFSADGHCRAFDANATGMVFGDGAGAIVLKPLDAALRDGDPIHAVIKGC 
ATNNDGDRKAG YTS VSAQGQAEV I RSAQI LADVAPES I S YVEAHGTGTKLGDS I E I KALKQAF 
ASDKNGFCGIGSVKTNLGHLMAAAGMAGLIKTVIAMKHRQLPPSLHCDEVNPDLELERSPFYI 
NTRLRDWVAPGGPLRAGVS S FG I GGTNAHVILEEPPTRESGTRMRHWKLLMLS AAS EAALDRQ 
ADNLADYLERHPEAHLSDVAYSLQTGRRVLAWRRTVLCEYREDAVTSLRERQAKRVQTSRVRW 
DHKDVVFMFPGQGAQYUaMGRDLYVMEPVFREVMDRCFELLAPLWSEHPRQILYPEGGVST^ 
HRTDYTQPIVFCFEYALAHLLLSWGLKPAATIGYSFGEY^/SACLAGVFSLEDAIRLVTERGRL 
MAALPAGAMLSVPVPECELLRLLDGFHAQSAAHLALAVDNGASCIVAGEQAAISAFESMLRKK 
RLLTMRVAVSHAAHS QVMTGATDALRS ILRKIPLSAPTIPFI S C VTGTW I TAQQATDR E YWVN 
HMCGTVRFAAGLTELGQNREAVFLEVGPGRDLTLLAHRIIJUDSAAVFELVKAPDGGDDDGFLL 
LDRiJVKLWRLGISIDWAGFYADERRRKLSLPGYPFERRRFWIEGNPLEIAAGRPNVQGPLVKA 
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SDIGAWFYVPQWRRSVLAEPGTTAAGAAVTAEQARWTELRAGCASAGLGSGACGLNGGAPSE 
RPKESVAPAGSTSAAAQTGADCPTPTGEPAAVPKDGAEPRPTWLIFADAGGLAESFAKRVQAR 
GEKLYLVASGSRFERLAETRFRLDP3AKSDHRLLFKALDEADILPTHLLDFRSLDCGGPDADP 
MDQAGFFGLLHLVQAMAEAGYSHPIRLLIVSCGVYDVTGAEPLQPARATMIGPALCIPQQYPH 
LETSHVDLGVVHADEUiAARQLDSLIJ^CLSATAERQLAIJlGRHRWLLDYEPVRLPPLDPGRL 
PWRQRGVYLITGGLGGIGRILAEHLARTTSARLVLIGRETLPDRDDWDAWLNRPQPVDATHER 
LLHKIRAIRDLEALGAEVLVIAADVANEAAMREAYDRAESHFGTIHGVIHGAGLMDAQSFSLI 
DALDHDLCARQFEAKIRGVCVLDRVLADRTLDFCLLMSSISTVLGGLGYFGYAAANAFLDAFA 
QARSRDAAFPWLSVAWSDWKYWTERKMDNEVGAVIDSLSMEPAEGFEAVTRVLAWGKAPHIAN 
SPGDLGRRRDQWVKLASLKSAHSSHPEPARHGRPALSSEWVAPRNWEEKLVAIFEQVFGTAA 
LGIEDNFFELRGDSLKAVMTAARIQKEl^EVPLPTFFQMPTVAGLAQFVTQAKRSGRETIRR 
TAFR PHYPLS AAQGRH YLH Y RMD PRCTAYNDP F ANL I EG PLDVDRVER I LHTL I LRHDCFRTS 
FHFREGEPVQVIHDRVDFNLARITCAPEDLPERMRDFIRSFDLERPPAMRAGLFVTGPERHVL 
LIDFHHIITDGVSFENFVGEFAALYRGEILPELELEYKDFAVWQHENRGRRANSDQARYWTEQ 
LANAPGPIELTTDFPRPSRRSFRGDRVRTVLDAELVARLKEHAARLGITLYSLLLGGFSLLQH 
KLSDSHDIVIGSPVAGRTRSELQDLLGAFVOTLPMRHRIDPTHTARVFLEQVHQTTLAALSYQ 
EHPFDEMVATLGFAADPARNPIFDTMFLLQNMAMGATTIPGLRLSPHDTFHRKALCDLMLQAT 
EYDCHLELVLEFATDLFRLETAQVLLDRYRQVLEWLLAYPHESIDDLTLAGHFREVEVTMSDE 
GDFDFSDFEPRNVRNLWRA 

pEPOcos6_ORF10 sequences : 

( 1 ) nucleotide sequence 

Seq ID No 59 ( >pEPOcos6_0RF10 . seq) 

ATGGCGCGCCTGAGCCGCACAGATCTCCAACTCGCCATTCACCAGCGCACCGTGGAGCGCGAA 
TATTGGCGCGCTCTGTTCGAGCGCCATCCGCAACGGTCCAGTTTGCCGGGGGTGCTCACCGCC 
CCGATCGGCGACGAGTCGACCCGCGAGACCTTGTCATTCGTCCTCGACGAAGATCCCCTTCGG 
CTGAGTAATCGTTCGCCGCAACGCC7GCTCACGGTGTTGGCGGCTGGCCTCGCGGCTTTCCTC 
CACCGCTGCGACGGCGCTGAGCGC7TCACCCTGGGGTTGGCCCTACCGCGCCAAGCCGATGAC 
CATCACCCGATCCTCAACAGCTTGATCGCGCTGGGGGTCGCGGTCGACTCGAGTACGACCTTC 
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CGCGATCTGCTCTATGCGCTTCGA7CCGAATACCACGAGGCGATGCGCCACGCCAACTTTCCG 

CTGGCGACCTGGTGGCGCGGCCTACCCGGCGGAACGGCGCCGTTCGACGTCGCCCTCAGCCTG 

GACCCCTTCACAGACGGCGATTCGCTGGAAGACCACGCGATCGGCGCGTTGTTCCGGTTCGCA 

TTGGAGGGTGAGCGCCTCACCTGCCGATTGCGATTCGACCCTGCGCGCTATGACCGTCCCGCG 

ATCGAAAACCTCGCCGATCGTTTCGCCCGCTTCCTCACGCGCCTGTGCCGGGACGCCTCCACC 

GTCATCCAGGCGCTGGACCTTTCGCTGCCAAGCGATGAATCGGTGTGGCGCGTCACTGAAGGC 

GTGCGGCGCGGCTATTCGCAAGACCTGACGCTAGACCGCGCGTTCCGCCGCCAGGCCGCGCAA 

ACGCCCGATCAGCCGGCGATCACGTTGAACGGGGACGTCCAGAGCTACGCCGAGGTCGACCGC 

CGCAGCGACGCGCTGGCCCGCCACCTCCGTCGCCACGGCGTCGGTCCGGAAACGATTGTGGCC 

GTCAACGCCCGGCGCGGGCCTAATCAGCTGACGGCCCTGCTCGCC-GTCCATAAGGCCGGCGGC 

GCCTACCTGCCGATCGATGCCGAGGAGCCGGCTGCCCGCCAGCAATTCAAGGTGCGCGACAGC 

GGGGCGCGGTTGGCACTGGAGCCGTCGCCGGACCAGGCGCTGACCGTCACCGACCTGCCGCGG 

CTCTTCCTGGACGATGCCTCGCTCTTCGCTGACGGCGGGCTCGATGTGCCGCGCGGCGCCGAC 

TCGCTCAATCCGGCCTATGTGATGTACACGTCCGGCTCGACCGGACAGCCCAAGGGTGTGGTG 

GTTCCCCACCGCGGCGTGGTCAATCGTTTGAATTGGGGGCAGTCCCGTTTCCCGCTGGACGAA 

CGCGACCGAATCCTCCAAAAGACGCCGCTGCTGTTCGACGTGTCGGTCTACGAGCTGTTCTGG 

GGCGCATGGAGCGGGGCCACCCTGGACATCCTCGAGCCCGGCGCCGAGCGCGACCCCGACGCA 

GTGGCCAGGGCCCTGGCCGAGCGCGCCATTACCGTATGCCATTTCGTGCCTTCGATGCTGCTC 

GTCTACTTGGAAGTCATGCGGCGGCACCATGCGCCGCCCGTGCCGGACCGGCTCCGTTACGTC 

TTCGTCAGTGGCGAGGCCCTCGAACCGGACCACCTCGCCGGGCTCCAGCAGATTGGTCGGCGC 

CTCGGCCGCACGATTCCCCTCGTTAATCTGTATGGACCAACCGAGGCCTCGATCGAAGTCTCC 

TGCTTCGCCTGTCCCGCCGACCATGTGCCGCGCCGGATCCCCATCGGGCAGCCGATCGACAAC 

GTCGCACT3CACGTTCTCGACCGGCGCGGCCGTCGCCAGCCGCCCTATCTTCCTGGCGAGCTG 

TTCCTGGCCGGCGACTGCCTGGCGCGCGGCTACCTCAACCGTCCCGACCTGACCGCGCTCCAC 

TTCGTGCCCAATCCCTTCGGCAACGGCGAGCGCATGTACCACAGCGGCGACTTGGCGCTCGTG 

CGCGGCGACGGCCAAGTGGCGTTTCTCGGCCGCCGTGACCACCAAATCAAAATCCGTGGTCAA 

CGGGTCGAACTGGGCGAAATCGAGAGTCATTTGCGCGGGCTCGAAGGCATCGCCGCCGCCGTC 

GTCCAGGCCGAGTCGCAGCACCATGAAACCCTGCTGCACGCCTACGTCGTCACCAACGACGCG 

GGCCTCAATGCGGCCCGGCTGCGCGCCGCCCTCGCTCAACATCTGCCCGAGTACATGATTCCC 

CAGCGCTTCTCGCGGCTGGCCGAGTTGCCGCTGCTGGCGGCAGGCAAGATCGACCGCGCCGCC 
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CTCGCGCAACGTGCAACGCCGCTCGCCAGCGGCGCGCCCTTCGT3GAACCCAGCGGGCCCACC 
CAGCAGCGTATCGCAGAACTGTGGCGCCAGGTCTTAGCGGTCGCCGAAGTCGGCGCCGAGGAT 
CCCTTCTTCAGCATCGGCGGCAACTCGCTCAATGTGCTCAAGCTCAGCGCCGCGCTGAGCGAC 
GCCTTCGCGCGTGACATTCCCATGCCGGCCCTGTTCCAATACGACACCATCGCCGCCCAGGCC 
TCCTGGCTCGACGGGCAGGTTGACGAACGGGCCCAATCCGCCGCGCTCGACCGGCAGGCCGCC 
GAGGCGGCGCTGACCCTTCAAGAGACCGTGGCCATTTTTGAGGGATTCGATGACGAACCA 

(2) peptide sequence 

Seq ID No 60 ( >pEPOcos6_ORF10 .pep) 

MARLSRTDLQLAIHQRTVEREYWRALFERHPQRSSLPGVLTAPIGDESTRETLSFVLDEDPLR 
LSNRSPQRLLTVLAAGLAAFLHRCDGAERFTLGLALPRQADDHHPILNSLIALGVAVDSSTTF 
RDLLYALRSEYHEAMRHANFPLATWWRGLPGGTAPFDVAI.SLDPFTDGDSLEDHAIGALFRFA 
LEGERLTCRLRFDPARYDRPAIENLADRFARFLTRLCRDASTVIQALDLSLPSDESVWRVTEG 
VRRGYSQDLTLDRAFRRQAAQTPDQPAITLNGDVQSYAEVDRRSDALARHLRRHGVGPETIVA 
VNARRGPNQLTALIJ^VHKAGGAYLPIDAEEPAARQQFKVRDSGARLALEPSPDQALTVTDLPR 
LFLDDASLFADGGIJDVPRGADSI^PAYVMYTSGSTGQPKGVVVPHRGVVNRLNWGQSRFPLDE 
RDRILQKTPLLFDVSVYELFWGAWSGATLDILEPGAERDPDAVARALAERAITVCHFVPSMLL 
VYLEVMRRHHAPPVPDRLRYVFVSGEALEPDHLAGLQQIGRRLGRTI PLVNLYGPTEAS lEVS 
CFACPADHVPRRIPIGQPIDNVALHVLDRRGRRQPPYLPGELFLAGDCLARGYLNRPDLTALH 
FVPNPFGNGERMYHSGDLALVRGDGQVAFLGRRDHQIKIRGQRVELGEIESHLRGLEGIAAAV 
VQAESQHHETLLHAYWTNDAGLNAARLRAALAQHLPEYMIPQRFSRLAEIiPLLAAGKIDRAA 
LAQRATPLASGAPFVEPSGPTQQRIAELWRQVLAVAEVGAEDPFFSIGGNSLNVLKLSAALSD 
AFARDI PMPALFQYDTI AAQASWLDGQVDERAQSAALDRQAAEAALTLQETVAI FEGFDDEP 
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pEP0co36_0RFll sequences: 

(1) nucleotide sequence 

Seq ID No 61 ( >pEP0cos6_0RFll . seq) 

ATGACGAACCATGACCATCACGAGGAGAGCAGCGGCCTGGAGATCGCCGTCATCAGCATGGCC 
TGCCGATTCCCGGGTGCTATGGCCTGCCGATTCCCGGGTGCTGCCGATTGCGACGCATTCTGG 
GAAAACCTGATCAACGGGACCTCCTCGATCACCCATTTCAGCGACGACGAGCTGATCGCGGCC 
GGCGTTGACGCGCGCGACCTGACGCCGCAGTACGTGCGCGCGGCCGGCCAGATCGATGACGCC 
GAACGGTTCGACGCGGCCTTCTTTGGGTACTCCCAGCGTGAGGCCGAGCTGATGGACCCCCAG 
TTCCGCCTGCTCCATGAATGCGCCTGGTCCTGTCTGGAACAGGCCGGCATCGATCCGCGCGTC 
GAAGCCGCGCCGATCGGGCTGTAT3CCGGCGCAGCCGACAACACCTACTGGAACGCGCTCTCG 
TCGCTCGACCGGGGCTCGGCCGAATCGGAGCAATTCGCCGCCGAACAACTTTGCAACCGCGAT 
TTTCTGTGCACGCTGGTCGCCGCCGCGCTCAACCTGAAAGGCCCCGCGGTGGTGGTTCAAAGC 
GCCTGTTCGACCTCGCTGTTGGCGGTCCACTCGGCCTGTCGTGCGCTCCTGACCGGCGAATGC 
CGAGTGGCCTTGGCCGGTGGGGTGGCGCTGCGCTTCCCACGCCCGAGCGGTTATCGCTACGAA 
CCTGGCATGATCTTCTCGCCCGACGGGGTGTGCCGGCCGTTCGACGCGGGCGCTAACGGGACG 
GTGCCCGGCGAAGGCGCGGGGCTGGTAGCGTTGAAGACGCTGAAACGTGCCCTCCAGGACGGC 
GACACGATCCACGCCGTGATTCGCGCGACCGCGGCAAACAACGATGGTGCCCGCAAGACCGGG 
TTCACCGCGCCCAGCGCCCACGGCCAAGCCGAAGTCATTCGCACGGCGCTGCGCCTGGCCCGG 
GTGCCGGCCGAATCGATCGACTACGTCGAGGCCCACGGAACCGGCACGCCGCTAGGCGACCCG 
ATCGAGGTAGCCGGCTTGGTGGAGGCCTTCGCCAGCGAGAAGCGCGGCTATTGCCGGCTGGGC 
TCGGTCAAATCCAACCTTGGTCATCTGGACACTGCTGCCGGCATCGCCGGCCTGATCAAGACC 
GTGCTGGCGCTCGAGCACGCGCACATCCCCAAGTCCTGCCACGTCGCCACGCCCAACCCCGCG 
GCGCGCCTACACAAGACGCCTTTCCGCATTGCCGCCGACGGGATGGCCTGGCCGCGGCGTATG 
GCGACGCCGCGGCGGGCGGCGGTGAGTTCGTTCGGCATCGGCGGCACCAACGTCCACGCGATT 
TTGGAGGAGGCGCCGCCCCGCGCGCCCGAGCTGGCGGACGGGCGCAGTCAGGTGTTCGTCTTC 
TCCGCCAAGGACGAGGCGGCGCTGGACCGTGCCCTTGCCAACTATGGTGCGGCCTTGGAGAAG 
CGCGGCGACCTCGCGGCGGGCGCGGTC-GCCTGGACGCTCCAAAACGGCCGGGCCGCATTCGAA 
TGGCGAGCCAGCGCGGTGGCATCCGACCTCGACGAATTGGCGGGCGCATTGCGCGGCGAGCGG 
CCCGGCGCCGTCAAGAAAAACCGAATGGCGCGCGAGGATAAGCCGGTGGCGTTCTTATGTTCG 
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GGGCAGGGGAGCCAGTACCGTGGCATGGGCCACGACCTGTACCGCGAAGAGCCGCGTTTCGGG 

CACCACCTCGACGCCTGCCTCGCCATCCTCGCCGAACACAAGCCCGAGATCGACTGGCTGGCG 

TTGCTGGGCTACCGCGACGAGGACGAGCCAACCGACCAGATCGGGACGTCCTCGCAGGGCCCG 

AGCCGGTCAGCCGCATCGAACCCAGCGGAGCTCCTCGACAGCACCGA.ATTCGCCCAAGCTTTG 

CTTTTCTCCATGTCCTACGCGCTCGGTCGGCTGTGGCTCGACTGGGGCGTGCGACCCACGGCG 

ATGATCGGGCACAGCCTGGGCGAGTACAGTGCTGCATGTATTGCAGATTTCTATGCACTCGAT 

CAGGTGCTGCCCTTCATTCTGACCCGCGGTCGAGTCATGGCGCAATTGCGGCGCGGCTCGATG 

TTGGCCGTCAGCGGTGACAGCGTTCTGATGCGCGAGCTGATCGCCGATGCGCTCGATTTGGCG 

GCGATCAACGGCGCTGACCAATTTGTCTGGAGCGGGCCGAGCGAGGCTGTCCAAGCCGCGGGG 

GTCCGACTGCGCGGCGCCGGCCTGCGTGCCACCGAGCTGAACACCTCACACGCGTTCCATTCA 

GCCATGATGGATCCCATTCTGGAGGAGCTAACGGTTGCCGGTTCGCGACTTCAGGTCGGTGTC 

GGGACGATTCCGGTCGTTTCATGCGTTACCGGAACCTGGTTGACGGCGAAGCAGCTGGCCGAT 

CCGCGCTACCACGCGCGTCACGCGCGCGAACCGGTGCGGTTCGCGGCGGGCCTAGCGACGCTG 

ACAGGGGAGGAGCCGCCGCTGATGCTCGAAGTGGGGCCGGGCTCGACCCTGGCGGCTTTGGCC 

CGCGAGCATTCGAATGCCCGCCTCCCGGTCGTCACCAGCCTGCGCCACGCTCGCCAGGCGACG 

CCCGATCGCCAATACCTGCTCGAAACGCTCGGCTGCCTTTGGCGACACGGGGTTTCCGTCGAT 

TGGGGGGCCCATGCCGGACGTTCGCGACGCTTGGTTTCGCTGCCCGGCTATCCCTTTTCCGGC 

GCC-GTGCGCCGCTTAGCCGGCGACCCCCTCCGCCTGCTGGCCGGAGCCCGCGCCGTCGCCGCC 

CCGTCGGGAACGCGCCAACTCAGCGCCGACGCGCGCGACCTCCCGAACACTCCGGAGCCGACA 

TCCGGCGCCGTGTCGGCGATCAAAGCGCCAATCGCCGCCGCCGATCCCGGCCTCTATCGCCTC 

TCCTGGCGCCAGGCCGGAACGGCGCCGCTCGGTCCGCCCGATCTCGGTCCGCCCCGCGACTGG 

ATCGTCTTCGCCTCTGATTCTCACCTGCTCCAGGCGCTCAGGGCCAATCTCGGGACGCGCGCT 

CAGCGGGTGACGCTGGTGACGCCGGGCCAGGAGTACGCAGCCGAGCCGTCCGGGTTTCGGCTG 

CGGCCGGACCAGATCGACGATTACCGCGCCCTGTGGGCGGACTTGGCGCAAACCGGTATTGTG 

CCACGATACATCGCGTTCCTCGCCCCGTTCATGTACCGGGCGCGCATGGCGGGCGATGCCTCG 

ACCCTGGACGAAGTGCGCGAGGGCGGCTTCCTGCCCCTGACCCGCTTGATCCAGACTCGCCCG 

CCAGGCGGACCGAGCGGACTTCTAAGCCTCACGATCGTCACCCCGGCCGCCCTGGCGCTGGGC 

GACGAAGCGACGCGCCCGGAATGGGCAATCCTGCACGGGATGGTCGCCGGCTTAAGCCGCGAT 

TATCCCGAATGGCGCTTCGTCTCGATCGACGGCGGCGACCCATCCCCGCATCGGTGCGAAGGT 

CTGGCCCGCTTGATCGCGCTTCATGCGGTCGACGAGGCTGGCCCGACCCGCTTGGCGCTGCGC 
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GGCCTTCACGCTTGGGTTCCACAGTGCGAGCACGTTCAGCCGGCCACCATCCCTGGGGCGGGT 
ATGTGGCGCGAGGGTGGTGTGTACATGATAACGGGCGGATTCGGCGGGATCGGTCTGGCGCTG 
GCCCGCGCCCTGGCTCGAGAAGCTCGCGCCAAGCTGATCCTGGTCGGCCGAAACCTGCCCACC 
GCGCCGATCGATCTCGAGGCTTGGGACGCGCCGCCGTTGATTCTCACCGCCGACGTCGCCGAC 
GAAGAGGCCATGCGCCGCGTCTTCGATGCCGCGCACGCCCGGTTCGGCGCCATCGACGGCATT 
CTTCACGCGGCCGGTGTCCCCGGTGGCAGCCTGTTCGCCAACCAATCGGACGCGGCCTTCGAA 
GACGTGCTGCACGCCAAGGTTCGCGGTACCCTCGTGCTGCAAGGCCTGAGGGCAATCGATGCG 
CCGCTGTTGCTGATGTCCTCGCTGGACGCCTGGCTTCCCGGTCCCGGTCAGACCGCCTATGCC 
GCCGCCAACGCCTTCCTCGACGCCTTCGCCAGTCTGCGCCGGCGAGAGGGAGAGCCGGTGTAC 
AGCGTTGGCTGGGACAGTTGGTGCGAGGTGGGCATGGCTGCTCGGGTCGCTGCCCGATCGGCC 
GACGAACGCGGCCGCCTGGGGCGCGAGGGGATCAGCCCTCGCCAGGGTTGGCAGGCTTTGAGC 
CGGGCGCTCGCCCTCGACCCCCCCCACCTGATGATCTCGCGCACCGACCTGACCTCGCGCTGG 
CACAGTCGATCCAGCCCTACGCCGGTCGCCTCGAGCGAACCCGAGGTGGCGCTGCCGCGCTGG 
ACCGCATCCGCCTGCCAAGCCGTCATCGAGCGTGTTTGGTGCGAGCACTTCGCCACCGCCGCC 
GTGCCTCCCGATGGCAACTTTTTCGAGCTCGGCGCCAGTTCCTTCGACATCGTCCAGCTCAGC 
GCTCGACTTCAACAACAGTTCGGCCGAGATGTCAGCCACACCGTGCTCTACAGTCATCCCACC 
GTCGCCTTGCTGGCCGGCTACTTCGCCAATGACCCGACGCCGTCCGGTGCTGCTGCCGACGAA 
CGCGACGAAGCGGTGCGTCGCGGCCGCGACCTCTTGAAGAGCCGCCGGCGAGGAGTA 

(2) peptide sequence 

Seq ID No 62 ( >pEPOcos6__ORFll . pep) 

MTKHDHHEESSGLEIAVISMACRFPGAADCDAFWHNLINGTSSITHFSDDELIAAGVDARDLT 
PQVVRAAGQIDDAERFDAAFFGYSQREAHLMDPOFRLLHECAWSCLEQAGIDPRVEAAPIGLY 
AGAADNTYWNALSSLDRGSAESEOFAAEQLCNRDFLCTLVAAAIiNLKGPAVWQSACSTSLLA 
VHSACRALLTGECRVALAGGVALRFPRPSGYRYEFGMIFSPDGVCRPFDAGANGWPGEGAGL 
VALKTLKRALQDGDTIHAVIRATAANNDGARKTGFTAPSAHGQAEVIRTALRLARVPAESIDY 
VEAHGTGTPU3DPIEVAGLVEAFASEKRGYCRLGSVK5^^^GHLDTAAGIAGLIKTVLALEHAH 
IPKSCHVATPNPAARLHKTPFRIAADGMAWPRRMATPRRAAVSSFGIGGTNVHAILEEAPPRA 
PELADGRSQVFVFSAKDEAAIiDRALANYGAALEKRGDLAAGAVAWTLQNGRAAFEWRASAVAS 
DLDSLAGALRGERPGAVKKNRMAREDKPVAFLCSGQGSQYRGMGHDLYREEPRFRHHLDACLA 
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ILAEHKPEIDWLALLGYRDEDEPTDQIGTSSQGPSRSAASNPAELLDSTEFAQPLLFSMSYAL 

GRLWLDWGWPTAMIGKSLGEYSAACIADFYALDQVLPFILTRGRVMAQLRRGSNTLAVSGDSV 

LMRELIADALDUVAINGADQFWSGPSEAVOAAGVRLRGAGLRATELOTSHAFHSAMMD^ 

ELTVAGSRLQVGVGTIPWSCVTGTWLTAKQLADPRYHARHAREPVRFAAGLATLTGEEPPLM 

LEVGPGSTIJUUAREHSNARLPVVTSLRHARQATPDRQYLLETLGCLWRHGVSVDWGAHAGRS 

RRLVSLPGYPFSGAVRRLAGDPLRLLAGARAVAAPSGTRQLSADARDLPNTPEPTSGAVSAIK 

APIAAADPGLYRLSWRQAGTAPLGPPDLGPPRDWIVFASDSHLLQALRANLGTRAQRVTLVTP 

GQEYAAEPSGFRLRPDQIDDYRALWADLAQTGIVPRYIAFLAPFMYRARMAGDASTLDEVREG 

GFLPLTRLIOTRPPGGPSGLLSLTIVTPAALALGDEATRPEWAILHGMVAGLSRDYPEWRFVS 

IDGGDPSPKRCEGLARLIALHAVDEAGPTRLALRGLHAWVPQCEHVQPATIPGAGMWREGGVY 

M T TGGFGG I GLALARALAREARAKL I LVGRNL ?TA P I DLEAWDAF PL I LT AD VADE EAMRRVF 

DAAHARFGAIDGILHAAGVPGGSLFANQSDAAFEDVLHAKVRGTLVLQGLRAIDAPLLLMSSL 
DAWLPGPGQTAYAAANAFLDAFASLRRREGEPVYSVGWDSWCEVGMAARVAARSADERGRLAR 
EGISPRQGWQALSRALALDPPHLMISRTDLTSRWHSRSSPTPVASSEPEVALPRWTASACQAV 
I ERVWCEHFATAAVP PDGNFFELGAS S FD I VQL S ARLQQQFGRDVS HTVL Y SH PTVALLAG Y F 
ANDPTPSGAAADERDEAVRRGRDLLKSRRRGV 

pEPOcosS 0RF12 sequences: 

(1) nucleotide sequence 

Seq ID No 63 ( >pEPOcos6_ORF12 . seq) 

ATGACCGTGGAGCACGAAACCGGATTCGAAATCGCCGTCATCGGGCTGGCTTGCCGCGTTCCC 
GGCGCTGCCGACGTGGCCGCCTTCTGGCGCAACCTGGTCGAGGCCAAGGAGAGCGTGCGCTTC 
TTCGAGGACCACGAGCTGCGGGCCGCCGGCGTGCCCGAGGAGATCTTGCGCCTGCCCAACTAC 
GTGAAGGCCAAGCCACTGCTCGCTGATGGCGAAGCTTTCGACGCGGACTTCTTCGGGTTCCAT 
CCGCGCGAGGCCGCCTACCTGGACCCGCAAGTTCGGCTCCTGCACGAATGTTGTTGGACCGCC 
CTGGAGGATGCCGGCTACGATCCCGCGCAGTACGCCTACCCGATCGGGTTGTTCGCGGGCGTC 
TCCAGCAATCTCTCGTTCCTGTTCGACCGCATCGATCCGCGCGACTCCCCCCTGCAGAAGCGC 
TATGTGGCCGAGCTGAACGCGGCCTCCTTCGCCACCCAGATCGCCTACCGGCTCGATCTGAAG 
GGGCCGGCCATTTCGATTCAAACCGCCTGTTCGACGTCACTGGTGGCGATTCACCTGGCGGCG 
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caaagcctgatcggcggcgagtgc:acatggccttggccggcggagcgaccttggaggtcccc 

AAAAAGCCCGGCTATCTCTACCGCGAAGGCTACATCAACTCGCCGGACGGCCACTGCCGGGCC 

TTCGACGCCGACGCGGGCGGCACCATCTTCGGCGACGGCGTCGGCATCGTCCTGCTCAAACGC 

TACCGCGACGCCCTACGCGACGGCGATCACGTGTACGCAGTGATCAAAGGCTCGGCGATCAAC 

AGTGACGGCCATCGCAAGGTGTCCTACACGGCGCCGGGCAAGAGCGGTCAAGTGGCGGTGATC 

CGCGCTGCGCTGGCGGCGGCCCAG3TAGAGCCGCAAACCATTCGCTTCGTCGAGGCCCACGGG 

ACCGGCACACTCGCCGGCGATCCGATCGAGGTAGAGGCGTTGACGGAGGTCTTTGCCGAAGCG 

GGTCGCGGTACCTGCGCCCTGGGTrCGGTGAAGACCAACATCGGCCACTTGGATGTGGCGGCG 

GGCGTGGCCGGTTTCATCAAGGCGGTCTTGGCGCTCGAGCGGCGCGTCCTCCCGCCCAGCCTT 

CACTTCGTCCGGCCCAACCCGGCCATCGATTTCAACGGGCCCTTCTACGTTTGTCGCCAAATC 

GAGCGGTTGACGGAGAACGGGCGG7TGCGGGCCGGGGTGAGTTCCTTTGGCATTGGCGGCACC 

AATGCCCACGTGATTCTGGAGGAAC-CGCCGGCGCCGGAGGCGAGACTGCCGGCCGGGAGCCCG 

CCAGGCGCGAGTCCGTTCCTGTTCCCGCTATCGGCCAAGACGCCGGATGCGCTGGCAGGCCGT 

TGCCACGACCTTGCCGACCACCTGCGGGCGCACCCCGAGCTCCTCCTGGCCGATGTGGCCCTC 

ACTCTGCAGATGGGGCGGGCGTCGTTCGCCTACCGCCATGTGGTCCAGGCTGCGACGGCGGAG 

GAGCTGATTCGCGGTCTGGGAGCGTTCCGACAGGAGTCCATCCGCAAGAGGCGGAATCGAGTA 

CAATGGGTGTTGGCAGGCGAGGCGATGTCGCTTGACGCCGGTTTGCGGCTGTACGCCGATTGG 

CCGGTCTATCGGGAGCGGGTCGACG7CTGTCTGGCGATCGTCGCCAAGCTGCGCCAAATCGAC 

GGCCGGTCATTCCTACATGAGTGGATCGAGCGACCGCGCGAGGTTCCTGCCGAATGGTCGACG 

GCGCTGGCGTTCATGTTCCACTGCGCGCTGGCGCAAGCCCTGAGCCAGGCCGGCCTGCACCCG 

CAGCGCATGTGGAGCCGTGGGCTGGGCGGACAGGTCGGCGTGGTTTTGGCCGAATCCCTGTCG 

TTGGAACAAGCGCTGGCGCTGGTG7TGTGCCAGACACCGGTTCCCGGCGATGCCACACCTCAG 

CGCGAACGCTTGGTTCGGACACTG3AAGGCTGCCGGTTTCGTCCACCACGATTTTTGATTTCG 

GCAGACAGCTCGGGTCGACCCCTGGACCTCGCCGAATTCGCTCATGTCGATTTTTGGTGCGGT 

GGCCAAAGCGCCTCGCCCAATGAGGCGGAGCTGCGCTCATGGAGCGACGCCGCGCCCGAGCTG 

GTGACCTTGGCGATCGGCCCATCCTTTCTCGAGGCCGCCTCCGGGACGGTGGGTCTGGCGATC 

GACCCCAAGCGACCGATGACCTGTGTTCAGCGCACGGTGGCCGCGTTGTGGGAATGGGGATGT 

GACGTGCGCTGGGCTGCGTTCACC7CGTCGACCGGGCGTCGGGTTCCCCTGCCTACCTATCCC 

TTCGTGCGGGTAATTCCCACGATC3GCGACCCCCTTCGCGGAGCAGGCGCGGAGGATGACTTG 

ATTGCGGCGAGCGCTTCCGCGTCG3CCGGATCGCCGCCCGAGCCGTCGGCAAACTCGGCAGCG 
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GAACGCCCACGCGCCCAGTCAAGCATCGCCTCGGCAACCACACCGGCTCCGTCTCATACGTCG 

GCCAGCGTGGCCGTGGCCACCATTCTCGAAACCGTCCGTGCCTATTTCGGGTTCGCCGCCGTG 

CGTTCCACCGACGCCTTCTTCGAATTGGGCGCGTCCTCGCTGGATTTGGTCAACCTGGGCCAG 

CTCCTTTCCGATCGTCTCGGCCGCGAGGTTCCGACCCTGCTCCTCTACGACCACCCAACACCG 

GACCAGTTGGCGCTGGCCCTGACATCCGCGGCGCTCAGCGCAGAGGCGCCGCCCTTAAGGGGC 

GGTCATCGCGCATCGACTTCCGGCACAGCCGCGAGCTCGGCCGCCTCCACCGCACCGACGTTC 

CCGGGGGACGCTCACTCGCAGCCCAGCTTCGTTCGCGAGCAGGACATCGCCATCATCGGGATG 

GCCTTCCGGGGACCGGGCGCCGACGACCTGGACGCGTTCTGGAACAACCTGGTCGAAGGGGTC 

GAGTCGATCACCTTCTTCAGCGAGGACGAGCTGCTGGCGGCGGGCGTCCCCCGCGAACATCTG 

GCCTCGACGCGCTACGTGCGGGCCAAGGGGGAACTGACTGGGATGATGGATTTCGAACCGGAA 

TTTTTCGGTTATTCGGCGCGCGAGGCGGCGGTCATGGACCCGCAGTTCCGCGTGTTCCACGAA 

TGCTCCTGGCACGCACTGGAGCACGGCGGCTACGATCCGACCCGATGCGCGGCATCGATTGGC 

GTCTACGCCGGCGTGACCAACCACCTGCCTTGGCTGATGCGAACTTTGCCGCACGTGACCGAG 

GAGGAGCAATTCGGCGCGCTGCTCCTCACCGACCGCGAGTTTTTCGCACCGCTGCTCTCCTAC 

AAGGTCGGCCTGCGCGGACCCGCTATTTCGCTGCAAACCGCCTGTTCGACGTCGTTGGTGGCG 

ATCGGCACGGCCTGTCGCGAATTGCGCGCGGGTGCCTGTCAGATGGCCCTAGCGGGCGGCGTG 

ACGGCCAGCATCGAGCGCTGCGGCTACTTCCACCAAGAAGGCTACATCCTCTCGCCTGACGGC 

CACACGCGCAGCTTCGACGCGGCGGCCGCCGGCACGGTCTTCGGCGACGGAGTCGGCATGGTG 

CTGCTGAAGCCGCTGGCCCAAGCCTTGGCCGACGGCGACACGATCCACGCGGTGATCAAGGGA 

ATCGGCATCAACAACGACGGCGCGCGCAAGGTCGGCTTCACCGCACCTAGCCGGGCCGGTCAG 

ACCGAGGCGATTCGGGCCGCGCTGCGCGACGCCGGGGTGGCGTCGAACCGCGTCAGCTACGTG 

GAGGCGCATGGAACCGCGACCAGAATGGGCGACCCGATCGAGGTCGAGGCCTTGACCCAAGCC 

TTTCGCGCCGAAGCCGACGGTCCGCTTCCGCCCGGCTCCTGCCTACTCGGCTCGGTGAAGTCC 

AAC3TGGGCCACCTGAACGCCGCGGCCGGCGTGGCTGGTCTGGTAAAAACCGTGCTGGCGCTC 

CAACACCGCCGCCTGCCGACCAGCCTGTTCTACCAGTCGCCCAATCCACACATCGACTTTGCG 

GCGAGTCCGTTCCGCGTGAACGGCCAGACTTCGGATTC'GGTCGCGCCAGAGGGGACGCGGTTG 

CTGGCGGGAGTGAGTTCGTTCGGTATCGGGGGAACCAACGCCCACCTGATCGTCGAGGAGGCG 

CCGAAAGCGCTACCGACGACAGCG3CACCTCTGTCGACGGAGCCGAATGACCTCGACGCGGGC 

ai^.CGCCGACGGGCTAGTGCTGCCGATCTCGGCCCGCACGCCGACCGCCCTGGCGCACATCGCG 

ACCAACCTCGCCAATCACCTGGAACGACATCCGACCATCGCCCTGGCCGACGTCGCCCTGACC 
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CTTCAGCTGGGCCGTCGCCAATGGCCCCATCGCCACAGCCTGATCTGCCGGAATCGAACGGAG 

GCGATCAAGCTGCTGCGCGCCGTCGTCCACTCCGCGGAGGTGCCGCCAGCTCAGGCGCCGGTC 

TCGGATGCGCCGCGCTGTGTTTTTCTTTTTCCCGGCCAGGGCGCCCAATACCCGAGCATGGCC 

CGCGACCTGGTTCGAAACTGTCCCGACTTCGCCCTGCACCTGGACCCCTGCCTCGACCAGTTG 

GCCGAACTGCTTCCCGAAGATCCGCGTTGCATCCTGTTCGGCGATGGCCCCGCCGATCGGCTC 

GACCAGACGGCCTACACTCAGCCGCTGCTCTTCTCCCTGTCCTACGCCTTGGCGCGCTGGTTG 

GGCGATTTCGGCATTCGCCCCGATGCGATGATCGGCCACAGCCTGGGCGAATACGTGGCGGCC 

TGCTTGGCCGGGCTTTTCTCGCTGAGCGATGCCCTGCTGCTGGTGAGTGAACGCGGCCGCCTG 

ATGGGCTCGGCCGCGCGCGGAGCGATGCTGGCCGTCCCCTTGCCCGAATGGGAACTGGAGGAA 

CGCCTGGAGCTTCTGGCCGACGACCGAATCAGCATCGCGGCGGTCAACACCGCCGAGAGCTGC 

GTCATCGCGGGACCCAGCGAGGCGATCGAGCGCTGCGCCCAGCGCTGGGCCGCGCAAGGCCTG 

ACCTGTACGCCGCTGCGCACGTCCCACGCCTTCCACTCCGCGATGATGGAGCCGATTGTCGAA 

CCCTTCGGCCATGTCTTGGCACGGGTCACCTTCGCGCCGCCGCGCGCGCGCTGGATCTCGAAC 

CTCGACGGCAAGCCGATCGATTCCGCGGCGGTGATGCAGCCCGACTATTGGGTGCGCCACCTG 

CGCCAACCGGTCCGCTTTCACGAGGGACTCAGTCACCTGTTGGCCGAGGACACCCATGCTTGG 

GTCGAAGTGGGTCCCGGCCGAACCCTGTCCTCCTTCGTCCGCCGCCACCCGGCCTACCGTCAC 

CAGCCAATCGTCAACCCCATGCGCCATGCAGTCGAGTCGACGGGCGACGTGCGCCGGTGGCGC 

CAAGCGCTGGGCGAACTATGGCGGGCCGGCATGCCGGTCGCCTGGGAGCGGCAGCGGCGCGGC 

CGGCATGCCGGACGACGTGTGCCGCTGCCGGGCTACCCCTTCGAGCGGCGGCCCTTCGCGGCC 

CGAAGACCGGTGGAGCTGGCGCAGCCCGCGCCCAAGGCGGAGCTGGTGAAAAACCCCGATCCC 

GCGCGGTGGCTGTACCGCCGCGTCTGGGGCCCTGCCCAGGCTGCGGCCGGCGGACTGGCGGTG 

CAGGCGACCGTTCTGGTCTTCGGCGACGGGTCCGAGCTGTGCCGCGCGGCGGTCGCTCAGGTG 

CAGCGCCAGGGGCTGAAGTGCGTCTCGATCACCGCGGGCCGCCAATTCGCGCGGGAGAGCGAC 

ATGrGCTTCACGCTTGACCCCGCTGATCCGCGCCAGCTCGACCAGCTCTTCGCGGCCCTCGAT 

GGCTCAGGCTCGCGGCCGCGGTACGTCCTGCACCTGCTGACCCTGAACCCGCCCCCGGATGCC 

TCG3CGATCATCGCTCACAGCTACTACAGCCCGATGGCCTTGGCTCATGCCTTGGGCGCCCAC 

GAC-ATCGCGCCTGTCTCGATCACCGTCGTCACCGCCGGGGTCGTCGCCGTCGCGGACGAAGCG 

AT7CGCGAGCCGCTGCAGGCGCTGATCGTGGGCCCGTGCCTGGTCATCCCGCAGGAGTTTCCC 

GGGCTCAGCGTTCGGCTGCTGGACGTCAACGTCGACGATCCGGCACCGCGTCTGGCGGAGCGG 

CTC3TGGCCGAGCTCTCGGGCACGGATCACATGGTGGCGCTGCGCGGCGGCGAGCGCCTAGTG 
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GGCCACTACCTGATTCTCGGCGGCCTGGGCGATATCGGCTACCACTGTGCCCGCTATCTGGCC 
CAAACCTACCGCGCCAAGCTGACGCTGACCGCGCGTTCGTCACTCCCGCCGCGCGCGTCGTGG 
GAGCGAATGCTGCGCGAGGGAAACCTGGATTCCCGGCAGCGCACGCGCATCGAGCGCGTGTTG 
TCGCTAGAGGCGTGCGGGGCCGAAGTCCAGACGGCTGCGGTCGACTTGGGCGATCGCCATCGC 
TTGGCCGATGTGTTCCGCGAAGCACGGGGCCGATTCGGCGCCATCGCGGGCGTGATTCACTCG 
GCGGGGATTCCGGGACACGTCCACTCGATCGACGAGCTGGTGCGCGTCCGCGACGAAGCCCAA 
TTCACCGCGAAGGTTCGAGGGCTGCACCACCTGGCCGAGGTCGTCGATCCGCTGAACCTCGAC 
TTTTGTCTGCTGTTCTCCTCGCTCTCGACCGTCGTCGGCGGGCTCGGCTACGGCGCCTATGCA 
GCGGCCAACGCCTACATGGACAGCTTCGCCCGCCGCCACGATCGGCCGGACGAATGTCGTTGG 
ATCGCGGTCAACTGGGACGCCTGGCTGTTCGAAGCCAAGACGTCGTCGGTCGGCGCCGAATTG 
GCGCGCCTGGCGATCGTGCCCGAGGACGCTCCGGCCCTGTTCGCGCGGGTGCTAGAGCGACTT 
CCGCAATCGTTCATCGTGTCCACCGCCGACCTTCGGGCCCGCATCGACACTTGGATCCGGGAC 
AAGAACCGCGTCCCGCCCGCCGAGATCCGAGCGGTTCAACCGCGACCGGACCTGAGCCAGGCG 
TACGCCCCGCCGATCGGCCCGCTGGAGATTCAACTCTGCGGGCTGGTCTCCGCCTATTGCCGG 
TTCGACCGGATCGGGCGGGACGATTCCTTCTTCGAAATCGGCCTCAGCTCGTTCGACTTGATC 
CAGCTCAGCTCGCGCATTCACCGCATCACCGGCAAGGATCTCAATACGACCCAACTGTTCAGC 
TACCCCACCGTGCGCGCCTTGGCGCTCTTCCTCGGCGGCGAACCGGAGGGGCTCGCGGCGGAG 
GAGCCCGCCATGGAGAACCTGTGGCTGCAACGAAGCGATGCGACCCTCGATGAG 

(2) peptide sequence 

Seq ID No 64 ( >pEPOcos6_ORF12 . pep) 

MTVEHETGFEIAVIGLACRVPGAADVAAFWRNLVEAKESVRFFEDHELRAAGVPEEILRLPNY 
VKAKPLLADGEAFDADFFGFHPREAAYU)PQVRLLHECCWTALEDAGYDPAQYAYPIGLFAGV 
SSNLSFLFDRIDPRDSPLQKRYVAELNAASFATQIAYRLDLKGPAISICTACSTSLVAIHLAA 
QSLIGGECHMALAGGATLEVPKKPGYLYREGYINSPDGHCRAFDADAAGTIFGDGVGIVLLKR 
YRDALRDGDHVYAVI KGS AI NSDGHRKVS YTAPGKSGQVA VI RAALAAAQVE PQT I R FVEAHG 
TGTLAGDPIEVEALTEVFAEAGRGTCALGSVKTNIGHLDVAAGVAGFIKAVLALERRVLPPSL 
HFVRPNPAIDFTJGPFYVCRQIERLTENGRLRAGVSSFGIGGTNAHVILEEAPAPEARLPAGSP 
PGASPFLFPLSAKTPDALAGRCHDLADHLRAHPELLLADVALTLQMGRASFAYRHWQAATAE 
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ELIRGLGAFRQESIRKRRNRVQWVLAGEAMSLDAGLRLYADWPVYRERVDVCLAIVAKLRQID 
GRSFLHEWIERPREVPAEWSTALArMFHCALAQALSQAGLHPQRMWSRGLGGQVGWLAESLS 
LEQALALVLCQTPVPGDATPQRERIVRTLEGCRFRPPRFLISADSSGRPLDLAEFAHVDFWCG 
GQSASPNEAELRSWSDAAPELVTL^IGPSFLEAASGTVGLAIDPKRPMTCVQRTVAALWEWGC 
DVRWAAFTSSTGRRVPLPTYPFVRVIPTIGDPLRGAGAEDDLIAASASASAGSPPEPSANSAA 
ERPRAQSSIASATTPAPSHTSASVAVATTLETVRAYFGFAAVRSTDAFFELGASSLDLVNLGQ 
LLSDRLGREVPTLLLYDHPTPDQLALALTSAALSAEAPPLRGGHRASTSGTAASSAASTAPTF 
PGDAHSQPSFVREQDIAIIGMAFRGPGADDLDAFWNNLVEGVESITFFSEDELLAAGVPREHL 
ASTRYVRAKGELTGMMDFEPEFFGVSAREAAVMDPQFRVFHECSWH/y^EHGGYDPTRCAASIG 
VYAGVTNHLPWLMRTLPKLTEEEQFGALLLTDREFPAPLLSYKVGLRGPAISLQTACSTSLVA 
IGTACRELRAGACQMALAGGVTASZERCGYFHQEGYILSPDGHTRSFDAAAAGTVFGDGVGMV 
LLKPl^QAIJ^GDTIHAVIKGIGIXTOGARKVGFTAPSRAGQTEAIRAALRDAGVASNRVSYV 
EAHGTATRMGDPIEVEALTQAFRAEADGPLPPGSCLLGSVKSNVGHLNAAAGVAGLVKTVLAL 
QHRRLPTSLFYQSPNPHIDFAASPFRVNGQTSDWVAPEGTRLLAGVSSFGIGGTNAHLIVEEA 
PKALPTTAAPLSTEPNDLDAGDADGLVLPISARTPTAI^HIATNLANHLERHPTIAIaADVALT 
LQLGRRQWPHRHSLICRNRTEAIKLLRAWHSAEVPPAQAPVSDAPRCVFLFPGQGAQYPSMA 
RDLVRNCPDFALHLDPCLDQLAEll.PEDPRCILFGDGPADRLDQTAYTQPLLFSVSYALARWL 
GDFGIRPDAMIGHSLGEYVAACLAGLFSLSDALLLVSERGRLMGSAARGAMIiAVPLPEWELEE 

rleliaddrisiaavntajsscviagpseaiercaqrwaaqgltctplrtshafhsammepive 
pfg:-:vlarvtfapprarwisnldg:<?idsaavmcpdywvrhlrqpvrfheglshllaedthaw 
vevgpgrtlssfvrrhpayrhqpi-.^pmrhavestgdvrrwrqalgelwragmpvawerqrrg 
rhagrrvplpgypferr?faarrf\^laqpapkaelvkn?dparwlyrrvwrpaqaaagglav 

QATVLVFGDGSELCRAAVAQVQRQGLKCVSITAGRQFARESDMRFTLDPADPRQLDQLFAALD 

gsgsrpryvlhlltlnpppdasa: : ahsyyspmalahalgaheiapvs itwtagwavadea 
irh?lqal:vgpclvipqefpgls^?.lldvnvddpaprlaerlvaelsgtdh^fvalrggerlv 
advdqvdglgvgiakvplrreghyiilgglgdigyhcarylaqtyrakltltarsslpprasw 

ERiVLREGNLDSRQRTRI ERVLSL3:ACGAEVQTAAVDLGDRHRLADVFREARGRFGAI AGVI HS 
AG: PGHVHSIDEL\niVRDEAQFTA:<VRGLHHIAEVVDPLNLDFCLLFSSLSTVLGGLG^^^ 
AA::AYt©SFARRHDRPDECRWIAv::CWDAWLFEAKTSSVGAELAR]^IVPEDAPALFARVLERL 
PQSFIVSTADLRARIDTWIRDKNr.VPPAEIRAVQPRPDLSQAYAPPIGPLEIQLCGLVSAYCR 
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FDRIGRDDSFFEIGLSSFDLIQLSSRIHRITGKDLOTTQLFSYPTVRALALFLGGEPEGLAAE 
E PAMENLWLQRS DATLDE 

pEP0Gos6_0RF13 sequences : 

(1) nucleotide sequence 

Seq ID No 65 ( >pEPOcos6_ORF13 . seq) 

ATGAAATACGAAACCACCGGATTGGAATTGGCCGTCATCGGTCTCGCTTGCCGCTTTCCAGGC 
TCACCCGATCCCGAACAGTTCTGGTCGAATCTGCGCGCAGGTCGCTCCGGAATCCGCCATTTC 
AGCGATGCCGAGCTGAGCCACATCCCCGCATCCCTGCGTCACCATCCGCATTACGTCAAGGCC 
AAAGGCGCGCTGGACCACGCCGATTTCGAACCAGCCTTCTTCGGCTACTCGCCCAAAGAGGCC 
GAGGTGATGGACCCTCAATTCCGGCTGCTCCATGAGTGCTGCTGGGAGGCGCTGGAGTCAGGC 
GGCTATGCGCCGAGCCAATTCGCGGGTCGGATCGGCTTGTTCGCGGCGGCGGCCTTCAACGAC 
GGATGGATGGCCGGTACCCTCGACCGGCTGCGCACCGGCGTGGGTTTGAGCTCCCTGGAAACC 
GCGTTCTTGACCCTGCGCGATTACCTGACCACCCAGATCTCCTATCGGCTCGATCTGCGGGGC 
CCCAGCCTGCTTGTCCAAACCGCCTGCTCGTCGTCGCTGGTGGCGGTCCAGCTCGCCCAGCAG 
GCGCTGATCTCCGGCGAATGCGCCCTGGCCTTGGCTGGCGGCGTGTGCGCGACCGATCCGCTG 
CATTCGGGATACCTCTATGAACCCGGCAACATCTACGCGCGCGACGGCGTCTGCCGACCGTTC 
GACGAGGCAGGCGCCGGTACGGTCTTCGGCGACGGGTGCGGCATGGTCCTGCTCAAGCGGCTG 
AGCGACGCCCAGCGCGACGGCGATACGATCTGGGCGGTCATTCGCGGGGCGGGCGTGAACAAC 
GACGGGCACCACAAGGTTGGCTACACGGCTCCTGGCACGAGGGGCCAGGTGGCTTTGCTTAAA 
AGTGTTTATCGCGCGAGCCGGGTCGACCCGGCGACGCTCGGCTACCTGGAGGCCCATGGCACC 
GGCACCGCGCTCGGCGATCCAATCGAGGTCGAGGCGCTTACCCAGGCCTTCGCCAGCAAACGT 
CGC3GCACCTGCGGCTTGGGCTCGGTCAAGGGCAACCTGGGTCACCTCAACACGGCGGCCGGC 
ATCGCTGGACTGATCAAGGTGGTGCTGGCGCTGAAACATCGCGAAGTGCCACCCACCCTCAAT 
CTGCGCCGTCCCAATCCGAAAATCCGCTTCGACGAGACGCCGTTTTTCCCAGTCGTCGAGTTG 
CAACCCTGGCCAAGCGGGACCGGCCCCTTGCGAGCCGGCGTGAGCTCCTTCGGCATCGGCGGT 
ACGAACGCCCACGTCATCCTCGAGGAGGCACCGCCGACGGCCAACCCGGCGCCACACGGCAGA 
TTCCGACTGTTGCCGCTTTCGGCCAAGACACCGGCTGCGCTCGAAGCGAAGCGCCGCGATCTG 
GCCGGCTTCCTCGAACGCCACCCGGAGACCTCCTTGGCCGACCTCGCCTTTACCCTGCAACGC 
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GGCCGCGAGGTCTTCAGTCACCGCGCCTGCCTCGCCGTGGAGACCTTAACGTCCGCGCGCACG 
CGGCTGAGCGGCGAGTCGTCGAGCACTTGCGTGGTGGGCCCCGCGCCCAGCGCCATATTTCTG 
TTCCCTGGTCAAGGCAGCCAGCTCGCCGGGATGGGCCGCGGTCTGTATCACCATTTCGAGCCG 
TTCCGCACGGCCGTCGATGCCTGTCTGCGCGAGCTGGAGCCAGGACTGCGGCAAGCGCTCAGC 
GCCCATTTCGATCCGAATCGCGGCGCGGACCCACCCGATTCGACGACCTTCGTCCAACCCTTG 
TTGTTCCTCGTCGAGTACGGGGTGACCGAGTGGCTACGCTGCTTGGGTGTGCGGCCAACAATG 
GTGTTGGGTCACAGCTCTGGCGAGTATGCCGCAGCCTGCGTCGCGGGCGTTCTGTCGCCGTCC 
GCGGCGGTCTCGCTGCTGGCCGAGCGCGAGCGGCTGCTGCGCGACCTGCCAGCCGGCGCCATG 
CTCGGCGTCCCGCTGGCCGCCGAGGCGCTCGAGGCGATGTTGCCCGACGCTCTCGATCTGGCG 
GCGATCAACGGCTGTCAGCTTTGCGCCGTGTCCGGGCCGGTCGCGGCGGTCCACGCCTTCAAG 
GCCCAACTGGAAGCCGCCGGACATCACGCCCGCCTGTTGCACACCGATCGCGCCTTCCACTCG 
CGGCTGGTAGCACCGGTGCTTGACCGGTTCCAGGCAGCCGTTCAACACGTGGAGC7GCGGCGG 
CCGCAAGTACCTTACCTCTCGACCGTCAGCGGGCGATTGGAGGCGGATGGGCCGGCGAACCCG 
CACTACTGGGTGCGTCACCTGCGCGACACGGTGCGGTTTGGTCCAGCCCTGGAGGCGCTGCCG 
CCGGTGGATTCCTTCGTGTGCATCGAGGTGGGACCAGGCTCGGCCTTGAGCACCATGGCGCGC 
GAAACGTTGGGTTCCCAGGCGCGACTGATTTCGTTGCTGCCGCGGCCGCGAACGGGGCAAATC 
GAGCCCGGTCCGGTATTCGAACGACTGGCGGCGCTTTGGCGCAGCGGGTTGACATTGGATTGG 
TCTAAATTGACGGGCGGCGAAGAGGGTCATCGAATTCCCTTGCCAGTCTACCCGTTTCAGCGC 
AGCCATCTGTCGAGCTCCCTGGCGGCGGGCCACACGCCTTCGTCGCGGCCTGCAGTCGAATCA 
GGCGCCATCCTTGCCGAGCGATCCGCAGGGGAAAACGCTGAAACCCGGGATTGCCCGCTGCCA 
ACCGCCACGCTCGAGCCCAAGGCGGTCGCTCCGGCCCCACTCGAGGCTACCGACGCCGCAGGT 
ACTCGCGAGCGACTGGCCGAACTTTGGCGCGAGTTGCTAGGGTTGACCTCGATTGGGCCCGAC 
GACCATTTCTTCGACCTGGGCGGCCACTCGCTGACCGCCACGCGGCTGCGCGCCCTGATTCAC 
CAGCGGTTCGATGTCGATCTCGGGCTCGACGAAATCTTCGCTCATTCGCGTCTCTCCCAGCTG 
GCCGCCCGTATCGAGGCGGCGGCCAAGAGCCGATTTTCCTCCATTCCCAGCGCGCCGGACCAG 
GACGACTATCCCTTGTCATCCGCCCAGCAGCGGATTCACAGCATCGTCACGAGGGCCGAGGTC 
GGCACTGCTTATAATTTTCCGATCGTCCTCGAGCTGCAGGGCGCTCTGGATCGAGTGCGATTC 
GAGGCGACGTTCGCGGCATTGTTCCGGCGTCATGAGGGGTTCCGCACCCGCTTTGTGATGCGC 
GATGGCGC-GCCGCGCCAGCGCATTGTACCGGACGTGGCGTTTCGCCTGCCGCTCACCCAGGTC 
GAGCCAGAGCAGGTTCCCGGGCGCATCGAGGCCTTCATCCGTCCCTTCGATTTGGAACGCGCG 
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CCGCTGTTCCGCGCGGAGCTGTTGCAGTTGGCCGAGCAGCGCCATCTGCTACTTTTCGACATG 

CACAACTTAATTGCCGACGGTATCTCGCTCAACCTGTTCGTCGCCGATTTCGCGGCCCTGTAC 

CATGGTCGTCCGCTGGCGCCGCTGAAACTCCGCTATCGCGACTATGCCGTTTGGCAAGAGGCG 

CGGCTGGCCTCCGATGACCTGCGCA3CCAGCGCGAATGGTGGCACCGGCGGCTTTCGCCGCCG 

GTCGCCACGCTGGCGCTCCCTCCCGATTTCCCGCGTCCGGCGGTGCGCCGCTACAAGGGCCGT 

AATGTGGTGTTCCACCTGGACCGGGAGATCCGCGACCGCCTGGTGGCCCTGGCTCGAACCCAG 

GGGGTCACCATGAACGTGATGATGCTGGCGCTCTGGGCTGCGCTGCTGCATCGCGAAACCGGC 

CAATCGGAGCTGGTGGTCGGATCGCTGCTCGGCGGGCGGCCGCACAGCGAGCTGCATCCCGTG 

ATCGGGCTCTTCACCAACTTTTTGCCCTTGCGGTTGGCGGTCGAGGGATCGACCCGCTTCGAT 

CGCTTCCTTGCCGCTTGCCACCAGGTGTTTCTCGAAGCCTATCAGCGCCAGGACTATCCGTTC 

CACTTGTTAGTCCAGGAACTCGTGCCGGTCAGGGACCCGTCGCGGTCGCCGCTGTTCCAGACC 

TCGCTCGTCTACCACAACGAAATTGACGGCAAGACCAAGCTGGAATTGGAAGGGCTGAAAGTC 

GAAGTGGTTCCCTTCGAAAAGGGTGTGGCGAGGCTGGATTTGAAGCTGGATGTGACACCTTTT 

TCCGACCGACTCGAATGTGTTTTGCAATACGACTTGGATCTGTTCTGCGAGGAGACGATGCGC 

GGCCTGATCGCGCGGTTCCAGGCGTTGGTGGCGGGGCTTGTCGCCGATCCGGCGCAATCGCTC 

GCCGCCGCGAGCGTTTCCGGGAAGCGGGCGCTGCGCGCGGGCGTGGCCACGGCAAGCGAATCG 

TCGCCGCAGTCACTGCCGCCGCAACCATCGACGGCGTACGCCACTCCCTCACCGCAGTCACCG 

TCGCCGGTAGTCCTGACGGGACCCGCCGACCTGCCCGCGATCTTGGCGGCCTACGTGGGGCAG 

AACCCCCATCCGTTCGCGATCCATCGGGGTCTCATTTTGGAGGCGCCGCTGGGGTTGCGAGCG 

CTGCGGTCGGCGCTGGACGCAGTGCTCGGAGAACACACCCATTGGCGCAGCGTGCGTGCGGGC 

GATCGCGCGCGGCGCGTGGATAAGTTGGAATTGACCAGCCTGGTGCGGCTCGACGACCTGCGC 

GGGTTGGTCAATCCTCAGGCGAATGCCTTCACCCTGGCTTGGCGCGATCTGGCGATGCCGTTC 

GGGGAGGGGCGTCCCCTGTGGCGACTCCGCCTGGCGTGGTCGGCTCCATCGCGCTGGTTGCTA 

TTGCTGACGGTTCATCCATTGATCGGCGACAACGGCACGGTCGACCTCTTTCTGGCGGCACTC 

GCCGATCACCTGCGCCGCGCGTCCGCTTTTCCCGTAGCACCGCTCGATGAGGCCGAGCTGGAG 

GCCGAGCTGAAGTGGGGAGAGGAAGGGGAGGGCCTCGGGCTGACCGCGATCGCGCCGGTCCTG 

GGCCAATTGCGCGAAAGTCGGCTGAGTCCTGTGGCCCAGATGTGGCTGGACGAGGtCTGTCGC 

CGCCACGACCTCACCCCGCTAGAGGTCTTGGCGGCCCGGCTCCTCGATTGGACACGAAGCCAC 

GGTCACGGGTCGATCGCTTTGTGGACGCCGCTGCCCGAGGACCATCCGCTTCGCGATGAAGGC 

CGCTGCCTCCAGGTTCGCCTGCTGGAGGGGCCGCCGTCGCAGCGAGGAGCGGGCGATCCAAGC 
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TGGCTCGAGCAAATCGCCTTGAGACGGGGTACCCCTGCAACGGAGGTCGTTTGCCCTACTCCG 
ACCCAACGGGCAGCCATCGACCTCGCGCTGGCCTGGCTGCCGCAGCCGCCTCTTCACGGTTTG 
GTCGGAACCGTTCAGCCGTGGCCGGAATCTCCATTGGTCTGTCCGTTTCCCCTCAATCTCGCG 
TTCCGGCCAAGCCATCCAATTGCCTACGCGCTCAAGCACGAGGCCACGCTCGCGGTCACGGCA 
CGGGCGCGCGATCTGATGCGTTTCCTCGACGGCTTGGGCCCGGAAAGC 

(2) peptide sequence 

Seq ID No 66 ( >pEPOcos6_ORF13 . pep) 

MKYETTGLELAVIGLACRFPGSPDPEQFWSNLRAGRSGIRHFSDAELSHIPASLRHHPHYVKA 

KGALDHADFEPAFFGYSPKEAE^/MDPQFRLLHECCWEALESGGYAPSQFAGRIGLFAAAAFND 

GWIAGTLDRLRTGVGLSSLETAFLTLRDYLTTQISYRLDLRGPSLLVQTACSSSLVAVQLAQQ 

ALISGECTUiAIAGGVCATDPLHSGYLYEPGNIYARDGVCRPFDEAGAGTVFGDGCGMVLLKRL 

SDAQRDGDTIWAVIRGAGVNNDGHHKVGYTAPGTRGQVALLKSVYRASRVDPATLGYLEAHGT 

GTALGD P I EVEALTQAFAS KRRGTCGLGS VKGNLGHLNTAAG I AGL X KWLALKHREVP PTLN 

LRRPNPKIRFDETPFFPWELQPWPSGTGPLRAGVSSFGIGGTNAHVILEEAPPTANPAPHGR 

FRLLPLSAKTPAALEAKRRDLAGFLSRHPETSLADLAFTLQRGREVFSHRACLAVETLTSART 

RLSGESSSTCWGPAPSAIFLFPGOGSQIiAGMGRGLYHHFEPFRTATOACLRELEPGLRQALS 

AHFDPNRGADPPDSTTFVQPLLFLVEYGVTEWLRCLGVRPTMVLGHSSGEYAAACVAGVLSPS 

AAVSLLAERERLLRDLPAGAMLGVPUU^EALEAMLPDALDliAAINGCQLCAVSGPVAAVHAFK 

AQLSAAGHHARLLHTDRAFHSRLVAPVLDRFQAAVQHVELRRPQVPYLSTVSGRLEADGPANP 

HYWVRHLRDTVRFGPALEALPPVDSFVCIEVGPGSALSTMARETLGSQARLISLLPRPRTGOI 

EPGPVFERLAALWRSGLTLDWSKLTGGEEGHRIPLPVYPFQRSHLSSSLAAGHTPSSRPAVES 

GAILAERSAGENAETRDCPLPTATLEPKAVAPAPLEATDAAGTRERLAELWRELLGLTSTGPD 

DHFFDLGGHSLTATRLRALIHQRFDVDLGLDEI FAHSRLSQLAARI EAAAKSRFS S I PSAPDQ 

DDYPLSSAQQRIHSIVTRAEVGTAYNFPIVLELQGALDRVRFEATFAALFRRHEGFRTRFVMR 

DGGPRQRIVPDVAFRLPLTQVEPEQVPGRIEAFIRPFDLERAPLFRAELLQLAEQRHLLLFDM 

HNLIADGISLNLFVADFAALYHGRPLAPLKLRYRDYAVWOEARLASDDLRSQREWWHRRLSPP 

VATIALPPDFPRPAVRRYKGR^^A/FHLDREIRDRLVAIARTQGVTMNV^4M1J^LW 

QSELWGSLLGGRPHSELHPVIGLFTNFLPLRLA^/EGSTRFDRFLAACHQVFLEAYQRQDYPF 

HLLVQELVPVRDPSRSPLFQTSLVT^HNEITCKTKLELEGLKVEWPFEKGVARIJDLKLP 
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SDRLECVLQYDLDLFCEETMRGLIARFQALVAGLVADPAQSLAAASVSGKRALRAGVATASES 

SPQSLPPQPSTAYATPSPQSPSPWLTGPADLPAILAAYVGQNPHPFAIHRGLILEAPLGLRA 

LRSAIXIAVLGEHTI'IWRSVRAGDRARRVDKLELTSLVRLDDLRGLVNPQANAFTLAWRDLAMPF 

GEGRPLWRLRIAWSAPSRWLLLLTVHPLIGDNGTVDLFLAALADKLRRASAFPVAPLDEAELE 

AELKWGEEGEGLGLTAIAPVLGOLRESRLSPVAQrWU)EVCRRHDLTPLEVLAARLLDWTRSH 

GHGSIALWTPbPEDHPLRDEGRCLQVRLLEGPPSQRGAGDPSWLEQIALRRGTPATEWCPTP 

TQRAAIDLAIAWLPQPPLHGLVGTVQPWPESPLVCPFPLNLAFRPSHPIAYALKHEATLA^TTA 
RARDLMRFLDGLGPES 

pEPOcos6_ORF13 . 1 sequences: 

(1) nucleotide sequence 

Seq ID No 67 (>pEPOcos6_ORF13 . 1 . seq) 

ATGACGCAAGCCTCGGCCGCGTCGACGTCCCAGGTCGCGCCGGAGGTCACCCCCGGCCGAAAG 
GACGACGATGACGATCAAATCCGAGATGTCGGCCGTTGCTCACTCTGCGGAGAGCGGCTTCCG 
CGCTGGGCCACGCGTGGGCGGCGCGATGAAGCGGGGCCGGACGCCGGAGCAGGCCGGCGTGAA 
GCTGCTCCGCGCCCCGGTGAAGCGGAAGTGGCTGCCCCCGGCGCCCGTCCTGCGCCTGAGCGA 
GCGGCGTATCCCGGAGGTGTGGGCAGGCTACCGCGCGAGCGCGGGATGACCCGAGCCCCGCCC 
GCCGGCGCGACCATGACGCCGCCCCACGGGGCGAGTCGTCCGGCGCGCCGGCGCGCGTCGGGG 
CTTCCGCCGCCGGGCGGGCAGGTGCAGGATGGTCGGGCATGG 

(2) peptide sequence 

Seq ID No 68 ( >pEPOcos6_ORF13 . 1 . pep) 

MTQASAASTSQVAPEVTPGRKDDDDDQIRDVGRCSLCGERLPRWATRGRRDEAGPDAGAGRRE 

AAPRPGEAEVAAPGARPAPERAAYPGGVGRLPRERGMTRAPPAGATMTPPHGASRPARRRASG 
LPPPGGQVQDGRAW 
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pEPOcos6_ORF14 sequences: 

(1) nucleotide sequence 

Seq ID No 6 9 ( >pEPOcos6_ORF14 . seq) 

ATGGTGACGCGTCCGACGTCCGACGGCATCGAGGACGAGCTCGCGCCGTTCCCCCCGGTCCTG 
CGCGGCTGGCTCATCGAGGGCGAGCTCGGCCGCGGCGGGATGGGGCGGGTGTTCCGGGCGCGG 
CACCCGAAGACGCGGGCGCGGGCGGCGATCAAGGTGCTGCTCGGCGACTACGCCCGCCGGCCG 
GACGTGGTGGCCCGCTTCCGGCAGGAGGCGATCGCCGTCAACATCATCAACCACCCGGGAATC 
GTCCGCGTCTTCGACTCCGGCGAGCTCGAGGACGGCTCGCCCTACATCGTGATGGAGTACCTG 
GACGGCCGGGGGCTGCGCGACTGGGTGCAGGCCGTGCCGCCCGCGGAGCGGCCGCGGCAGGTC 
GTGCGGCTCGGCTACCAGATCGCCTCGGCCATGGCCGCGGCGCACGCGTCCAAGGTCGTCCAC 
CGCGATCTGAAGCCGGAGAACATCATGGTGGTCGAGGACGAGCTCGCGCCCGGGGGCAGCCGC 
GTCAAGATCCTCGATTTCGGCATCGCGAAGGTCCTCTGGGGAGGTCTGCCCGAGGTGCTGGAG 
CTCGAGGGGAGAGGCTCCCTCGCGCCCGCGTCCGCGTCCACGATCCGCACCGAGCTCTCGACG 
CGGCCGGCGCCGACGGTGGGCGCCACGACCGGCCCAGAGAGCCCGCTGGGCGCGAGCGCCACG 
CCAGAGAGCGCCCTGGGCGCGAGC3CCACGCCAGAGAGCGCCCTGGGCGCGAGCGCCACGCCA 
GAGAGCGAGGCCCACGAGGAAGACGCGCTCCGGAGCCTCCCCGTCGTGACCAGCGGCAGGCCC 
GCGATCCACCCCGCGCCGGTCGAGATCCCGCCCGAGGCGGTCTCCTCCGCGGCGTCGCGCGGG 
TCGCGCGCGTCGATCGAGCCAGGCGCGCCCGCGCCGCAGAGCGAGGGCGCGGGACAGCCCACG 
ATGCCGTTCACGCAAGAGGGCGTGTGGGGCCTCGGGACGAGGAGCTACATGGCGCCGGAGCAG 
GAGCGCCACTCCGGGAGCGTGGACGTGAAGGCGGATGTCTACTCGCTCGGCGTCATCCTCTAT 
GAGCTGCTCGAGGGGCGGACGCCCGACGCGCCGAGCGCCGCGTGGCCGCCCCCGATGAGCGCC 
GCCACGCCGCCCGATCTCGTCGCCCTCGTCCACCGGGTTCTGGCGTTCGATCCCGATGCGCGG 
CCGCGCATGGCGGAGGTGGCGAGCGCGCTTCACCGGCTCGGCCGGGCGAAGAAGGAGCTCGAC 
GAGGCGCTCTCGAGGTGGGTCGTCGGCGGAGGGGCGCCGGGGCTCTTGCCGTGCGGCTATGCT 
CT7CTCGAACTGGTCCTCCTGGGCCCTGGGAACTTATACGATTCTTTCCAGCCTGTAAGTGCA 
TT7TTCTTTCAATATCGTCCTCTCTTCATATACGAGGTGAGTTCTCTGAGGTCCTCCTATAAG 
TCTGGGGTGTCCTATTCGGCCTCTTACTTGTTACTTCGCCTTCTTAGGAGTTTTTCCTTAATT 
TTGCCCTCTTACATTCCCGTATTCATTCTAACTGGGCCCTATCTCATTCGC 
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(2)peptide sequence 

Seq TD No 70 { >pEPOcos6_ORF14 . pep) 

MVTRPTSDGIEDELAPFPPVLRGWLIEGELGRGGMGRVrRARHPKTRARAAIKVLLGDYARRP 

DWARFROEAIAVNIINHPGIVRVFDSGELEDGSPYIVMEYLDGRGLRDWVQAVPPAERPRQV 

VRLGYQIASAiyiAAAHASKVVHRDLKPENIMWEDElAPGGSRVKILDFGIAKVLWGGLPEV^ 

LEGRGSLAPASASTIRTELSTRPAPTVGATTGPESPLGASATPESALGASATPESALGASATP 

ESEAHEEDALRSLPWTSGRPAIHPAPVEIPPEAVSSAASRGSRASIEPGAPAPQSEGAGQPT 

MPFTQEGVWGLGTRSYMAPEQERHSGSVDVKADVYSLGVILYELLEGRTPDAPSAAWPPPMSA 

ATPPDLVALVHRVLAFDPDARPRMAEVASALHRLGRAKKELDEALSRWWGGGAPGLLPCGYA 

LLELVLLGPGNLYDSFQPVSAFFFQYRPLFIYZVSSLRSSYKSGVSYSASYLLLRLLRSFSLI 

LPSYI PVFI LTGPYLIR , 

or DNA sequences complementary to said open reading frames, 

(b) DNA- sequences which hybridise under stringent conditions 
to regions of DNA sequences according to (a) encoding proteins 
or to fragments of said DNA sequences, 

(c) DNA- sequences which hybridise to the DNA-sequences accord- 
ing to (a) and (b) because of a degeneration of the genetic 
code , 

(d) allele variations and mutants resulting by substitution, 
insertion or deletion of nucleotides or inversion of nucleotide 
segments of DNA-sequences according to (a) to (c) , wherein the 
variations and mutants offer isof unctional expression products. 

10. DNA sequence according to . any of claims 1 to 5, wherein 
the DNA is selected from the group consisting of 
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(a) Che following DNA Sequence: 

10 

Seq ID No 71 (>Contig4 3) 

CGGGTATTTGTGATATGTGGGCNGTAGTC3TATGCTTCATTAAGTACATC 
5 CGTCCGTNGTAGAGAGTGACTCTGTCGCAGCGATAATAGACACGCTTGTG 
15 ATGCTATAGGGAACATAGAGTCNTAGTAGATGATACGACGAGATATTNGT 
ATAGAGCGTATAGACCGACGTGTGAGCGTCATAAGTGTTGTGTGTCATGA 
GTGTGCTCAGAGGACGTGCAGACATTATATGAGCAGATGATGAGAGAGAA 
TCAATGCTGCAAGNTATTCGTCGAATCTACATTATATCGAATCGTGTATG 

20 

1 0 TGC3TTTGTCGCAGCGCGATNCGATGAGATACCGAAAGGGTATGTATCTA 
TNTTCGTGACGCTCGATNAGAGCAAATCCGCTACCGTGGAGATATCGTGT 
ATCGACTCCATCACGATCAGTATCATGATACGTCAAACGAGTACACTCAT 
25 TATTGATAACACACGTANGTGTGCATGCACAGTTATCGAGTGTATTGTGT 
GCATGAGAGGTATAGGATNTATAGGCGAGCATATATATCTATATATATAG 

1 5 GTTAAGAGTAGAANACTATGAAGATGCAGGAAGTAGTATCTCGCGGACAA 
ACGGNGTACCTAGCGGGGTTGAAGTATTATCGACAGTGTATAACGACTCA 
30 ACA3GGNTACGAGGTACATTGTATTTACAGTGGTTGGAAGGATTGCGCGA 
GGAAAGGTAGTGGTACCGTGTGAGCTACGATGCTCGGGATAATGGTGATT 
AGATAGAACCTTAGCGTTGCTAGATGAGTGAGTGGTGGTATGAGTAGAGT 

2 0 TTTTGTTCTAGCTTTGTGTCCAGCGAGGATTCGTTCAGTCTGAAGGGTAA 

35 

GAGTACGTCCATCGCACACCCGACCGTTTTGAGGAGTTCTCGGTGCGTGG 
TCAGTGGGGTTTGGAGAAGACAGAGTTGATTCATAGGGTTATCT^CGAG 
TTATGTGGATAGATGGTAGTGACCCCATTTGAGTGAGAGTGTTGGCGTTA 
^ ACANCAGCAGGATNTAT 
25 

SEC ID No 72 (>Contig4 4: 

TAG3rCTTTGACACCATGGGAGCTGCTACCGATGTTGCCGAGCACGATCG 
CGCZGGCGCCGACGAGCGACTGCAAGCCGGCCGCGGCCATTTACGCCTGA 
CGA3CGAGGTGGGCGAAGTGCTGGTGCGCGCCGTGCGTGTCGAGCGCGCC 
30 CAGGTCCGCCGTTGCGCCGTCGCC3AGCAGTAGCGCGCCGTCGAAGACGA 



55 
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TCACCGCGATCGAGGTCAGCGTCGTGGGGGCGAGGCCGAGGAGCGCGAGG 
;0 ATGCCGAGCACGACGCCGGCCGCGCCGTAGACGAGCTTGGCGCCCATGCC 
GCCGCCGAGCTCGGTGCGCGTGTCCCACTCGACGGGCGGCGCGGTGCGGC 
TCAACGCGCCGAAGCGCGAGGCGATCGCGCCGCCCTGCGCGATCAGCGCG 
5 GCGCCGAATACGATCGTGGCGATCTGAGGTGAGCTCTACTGGCATGATCC 
^5 CCGTCAGCCCGAGGATGGTGAGGACAATCGTCGCGGCGCCGCACAGTACC 
TCGCACGAGCGAGCCTCCGAGCACGACCTTCGGCGTCGTCTCGTCCTTTG 
GTCTGCGTCGCGCGCCCGAGTGCGGCGTTATGTGGCTCTCCGGCTGTGCA 
AACCGTTCACGTTCTTCCGGTCCTGGAGTCAGCATCGGCATGATTCCCCC 

20 

10 GTCCTGCGGTGAGGCCTTGTCGCGCTCACGCGCGCTCCGACTTGCACGTG 
CTGTGCCGGGTTCTCTCGCTCAGGAGGCGCCTCTCTTGGTGGTGCTTGCG 
TCCTGGTCCGTTTGCCCGCCTGTGCGGTAGGTTTCTTGAACCAGGTGACC 
25 TTCAGGGACCCCTTGATGCGCTCCATCGTGTCCTATGTCGATCCTTCTCT 
GACTTGTATGGGTCTCGAACCAACTACGCTTGATCAGGCCTTCGAAGGGT 

15 CCTTTGGGAGATCGACTCTGGATCCATACCGGGAGCCCCTGTTCTGCCGC 
TCTCTTAAGTTTCCCCTTCTGTATCCGTGTCGACCGGAAACGCTTTATCT 
CTAATGCGCTCTAATTGCGTCTCTGCCACACGTGCGCTTCACTCTGGATC 
TACTTCTTCTCCCTAGTCTTCTACCTCCGTACCCTTATTTGTTGGTTCTA 
TTTATTTCTTTTCGCTTCACCTCGCGTCATTGTCGCCTAGTGTTCCTCCC 

2 0 TCATATCGCCTTTGGTCTCCCTCGAGCGTACAGTCCTCTCTCTTCAGATG 

35 

CTTTCCGGCTCCTCTTCTGCTGGCCCCTTATCCTTTCTAATACTTC 

SEQ ID No 73 (>Contig4 8) 
40 ATGCGCCCAGGAACACCCCGGTGCGGCTGCCGTCGAGGGACTGGGGTGCG 

2 5 ATGCCGGCGTCCTCGAGCCCTTCCCAGGTGACCTCCAGCAGCAGGCGTTG 

CTGAGGATCGAGCGACCGCGCCTCCCGAGGCGAGGTGCCAAAGAACGCGG 
CGTCGAAGCCGTCCACCGCCTCGGTGAGCAGTCCGGCCCAGCGCGGCACC 
''^ TCCTCGCTGGGATGGACGCCGACCAGCGCCCAGCGCCGGTCGAGCGGCTG 
GACCGCGTCTCGGCCTGAGTCGAGCAGCTCCCAGAATGCCTCCGGAGTGT 

3 0 CCGCTCCGCCGGGGAAGCGGCAGCCAATGCCTACGATGGCGATCGGCTCG 
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GTCCGCTCTTGCTCCAAAGACGCGTTCTTTTTCGCAAGCTTGTCCATGAG 
CAGAAGGGCATGCTCAAGCTTCCCGGCATTCGTGGTCGCCATACTCCCTC 
GGTCCCTTACTCACCAACGATCTGCGCGAGCTGCGCCAGCTTTTCGGCGA 
GCAACGCGTCCTTCTGCTCGTCCGTCATGCCCCGCAGAGCCTCGAGATCT 
GCGGCATCGTTCTCGAAGCTCTTCTCCCGCTCGGTGGCCGGAGCGTGGGT 
CGCGCCGGCATTCGGAAACAGAATGTCTAGCAAGCTCCCGCTCAGAGCTG 
CTACGTTAGGGTAGGTCCATAGCAGGGTCGCCGGCACGGTGATGCCGAGC 
GCGGCCTCGATGCGGTTGCGGAGCTCCAGGCCTATCAGCGAGTCCATGCC 
GAGATTGCTGAACGGCACGTGCCGCTCGATCCTCTCCGGCGGAAGGCGCA 
GCCCCCGCCCCAACAGCTCGCTCAAGTGCTTCTCCAGAATCAACTGACGA 
TCTTCGGGCCTGGCGCTCTGCAGCGCCTCGCGCAGGTTCGACGCGTTCGA 
CGCGCCTCGGTCGGCGCGGTCACGCTCCTTCAGCAGCTCCGCCCACAGCG 
CCAATCGGGCCGCGTTGGGATAGAACTC 

SEQ ID No 74 (>Contig4 9) 

ACCACCGCTTCACTCAGTATGTACTTTGTTATACTCGTCTTAGTACAATG 
ATATAATACTCATGTGTATTCTTAATCTCGGGGAGANAAAATTGGAATAC 
TGGACACCGTTGCCGCATGCNGACTCTAGAGATCCCCCTGCGACGGTATC 
CCACGGCACCGGTATGGCCGGCGCGCGCTCCGGGGGTCAACGCCCCGTGG 
TTGCCTTCACGACAACGCCGGTCGGGCGGGGCGCCGTTCGATGCCGCGGG 
CCCGCGCGCGGCGGCGCGTTATCCTGTGGAGCATCTGGAGGGCGCTCACG 
CACCTGTCAGTCTAGTTCTGGCCCGCCCGGAAGGAGTCCGGGAGGCCGAA 
GTTGAACCCGATGTAGAGCGCGATGAACGACGGGAGCACGCGCGCGGGGA 
TGTGCAGCGCGGCGCCGATCGGCGTCGCGAACAGGACGAGCTCGCCCGGC 
ATGCCGGGCACGACATACCCGAGCAGAAACACGATCGGCACCACGAGCGT 
GAGCTCGAGCAGCGATATTTCATGACCGACCGCGCGGGCGGCGGCGCCCG 
CCATGACGAACACGCAGATCAACGTGCCGTTGACGTTGAGCCAACCGCCG 
AGTCCCACCACGAAGAGCCTGAGCTCCTGCGGCACCGCCGGATAACATTT 
GCGGACGAGGTGCAGGTTGAGCGGCGTCGCCAGCGCCTCGCTGCACGAGG 
CCCACAGCAGCGGATAGACCTTGAGCCAGTAGTTGACGAAATAG7CGCGC 
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AGCGAGAACTCCGGGGCGGCGGCCTTCATCCGCAGCAGGCTCGCCGCATG 
GAAGACGAGGCAGGCAGCGCCGACGACCCCGGACACGAGCAGATAGGAGA 
GCATGAGGTCTCCGGCGCCGAGCGAGGGCGCCCCCGCCGAGGCGCGCGCG 
AGGTCCCCCCCGTGCACCTGCGCGGCGAGCTGCGCGGGCAGCCCGCGGAG 
5 ATAGGCGCCGAGCCCGAACATGAAGAGCGGGACCAGGCACTGCACGGCGC 
^5 CTCCCGCGCGCTCCAGCGCGTCCGCGGCGCGCTCCAGCGCGCGGGCGACC 
CGCGGCGCGCGTACGGCCGCGAACGACGTCACGATGCCGGCGTAAAGGGC 
GAGGAAGCACGGGCTCGAGATGACCAGGCCCGAGGCGCTATAGAGGGTGC 
GCGCGGCCTCGAACGGCGCGCCCGTGCTGTGGCTGGGGAGCAGCGGCAGC 

20 

10 CCGAACACGAGCCATGTGACGACG.-.CCCCGAACAGGCACGCTGCCAGACG 
CTTGAGGGCGAGCCAGCCCATGATGrACGCGAGCAGCCGCCCCGGGCGCC 
CTTGCCGGTGCAGGCTCACGAAGG7CGGCACGAGGACGACGAAGATGACG 
25 ACCGGCGCCAGCGTGGTGTACCAATGCAGGAGACCGTCCATGGCGCGGGT 
CGACCACCGCGTGACGCTGGTCTC7CTGTCTGACTCGATCATGGCCCATT 
1 5 CGCCTAAAACTAATGATCCGTTCTC AAATTGGTCAAAAAAAAGTTCCCTT 
AAGACTGTTTTACTCCGGAATATTAATATATTTCTGAGTGTGAGGTGATG 
30 TTAATCACACATTCTGATATTCTCAAGGGGAATCCGTGTCATTGTGAATA 
CTTCTCTCTCTACAAGAGAGGTTATATATGGTCTCGAATATCTCGTCCGC 
TCTTATATATATTCTCTTGTGATAArATATATCGAGTGTGGGTACTCAGC 
2 0 TCTCTTGGTGTAATCTATAACTCGGCATCTCTCATAATACCTTATATATA 

35 

CACACTCTC7CGGTCATATCTCGCATAATAGATATATTTTATATGTTCCG 
CGTTTTATCCGAGTGGGATACACTTTTTCTATATTTTCTTTGGTGTGACG 
CGTGGCGTCGAGCCTTATTATTGA-TTGGTAGTCACGATATTCTCTAGAT 
^ GACATCATACAGATGCTCATAACTCGATAAACACAGGTCGTACACGACGA 
2 5 GACTCTCACTCTCACTCTT 



SEQ ID No 75 (>Contig5C 

TCCCCAGTTTCTCCTCTCTACGC:C"ACATCTCAGCAGGAAAAAANATAAT 
GGAGAATCGTTGCGCTCTAGCAGCArCTATAGGATCCCCGCTGCTCTTCT 
3 0 TCATGCACC7CGTGGAGCAGAAGT7CATCAACGCCTTCGCGATCATCGTG 
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GCGGTGAGCTTCCTGGCGTTGCTCCTGTCGCTCGTCGTCGCCGACGTCGC 

GACGCGGAACACGTTCCCGCCCGCGCCTTTGCCGGCGCTGAGCCCGCCGG 

CGCCGGCGCTGATCCCGCCGGTGCCGGGCGGATCCGTTGGGCCGTCGCCG 

GAGCCGCTGTCGGTCGGCCGGTGATCGGTTGTGCGGGCGCCGTGCCTCGG 

GCTTACTACCCCCTCTCGCGGGTGGGGATATGGCCGTGGATGAGGGAGGC 

GATQAAAATCGTGATCGCCACGTGCGCGTTGTTCTAGATCGTCCCAGGCT 

GACCGTCGGGAGCGCCCAGCACGAGATGAAGAGCCACACCGCGAGGACCG 

TGTT3AGGTACCGCACCGCAGGGGCGAGCATGGCGGTGATCGCGAAGATC 

ATGCAGAGCAGCCCGAGCACCCATGTGTTCGTCCGCTGCGCGTGGCTGTG 

CGGCCAGATGACGGCCGAGATGAGGAGCCAGAACCCGAGGACGACGTTCA 

CGAT3CGCGCCATGAGATTGCCAGCTCGAACCATGCTCCCTCCCACCTCC 

GATCATGGGACCGATCGGGTCGCCACC-GATCGATAACGGGCGTCAGGAGA 

CCGTCAATCGGCGAGCTCGTGAGCCATGGCGACAGCCCGCCGACCGCGCC 

GGCGGGTCTCTGGCCTGCTGGTCGCCGTGCCGGCGGCGGCGATGGGCCTG 

CCTCCGGTCGGGCCGCGCGGGCGCGCGGCGCTCCGCGCGAAGCTGGAGAC 

GCGGGACATCGCTCCCTCGCCCCGCGCTCGGACGAGCGGCGCGAGCCACT 

TCTCGACGGCCGAGCGGGCACTAAGCTTCCGTCATGAGGCTCGGCGCACG 

GCTCACCACGCACACGTTCTCGGCCGGCGCCGCCGGCATCAGCTTCGTCG 

TCCAGCCGATCCCGGGCTCGGACCAGCTGTTCGTCATTCCGATCCAGTAC 

CTGCTCGCGGCGTCGCTCGCGAAGGAGCGAGGCGCGCCGCTCTCGAAGGC 

GGCGTGGTCCCAGGTCCACCAGCTCATCTGGGGCGGCGGCGCGCTTCGCC 

TCATGCTCGGCTTGACCCTAGGGCTGATCCCGCTGGCCGGCGCGTTCACG 

AACGCGATGACGGCGTTCCTCACGACCGAATATCTCGGGTACTACGTGGA 

TAGAGCCCTCGACAACCCGGACAATCCGCCTCCGGCCCTGTCGATCCAGG 

ATGTCTTGGACGCCATCACCTCCTTCTTCACCGGGCGAGCGCGGTAGGCG 

AGCGGTCCCTGGGTCGAGCCCACCCTGCGGCTCTAGGAGCCGAAGGGCGA 

GCTCCTCGGGAGCGGCGCGGCGTCACCACCAGATTCGCCGGCGCTTGCGG 

CCGC-AGCGTATCGCGACCGCCGCCACCGCCGCCACGGCGAGCACGGTGAC 

CGC3GCGGCCGCCGCGATGGCGACGCTCCGGGCCGTGTGCTCGGCCTCGC 

CC7TGATGCCGCCCCACCTCGCCTTGACCGTGACCAGGTCGCGCGCGAGC 
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CGCTCCTTGCTGTGCTCGATCTGCTCGGCAGGCCCCGCGGGCGCCACGAA 

ACCTGCGCCGGCGCTGACGTGACCTCGCTCGCTGAGGGGACTGGCGACCC 

TCTCCGTGTCGAACCGGATTGCGCTGGATGGATCCATATGTCCCGCGCTG 

CAATCGTTGCTCCCCGCCGGCACATCGGAGTGCTCGCCGGATCGCGCGGC 

AGCGCCGACGCCGTACTTCCATAGGATAGCCCACCCCATCGGACAAGCCG 

GCTCCTGACGGCGGGCACCGAATGTTCGCCAGACGGGCACAAGGCGCACG 

CCGCGGACGGATCGGCCGCACTGGCACTCCAGAGCGCATCGACGGATGGC 

CGACGGATGTGCAATGAGGCGCCCGCACGAAGCGAATTGTCCCGAATACA 

GCGAAGAAATCTATAGCGATGCGAGCAGAAGGATATGTCTATGGGGGGCA 

GTCAGAAACTGGGGACAGTCAACGCACATATTCTCTCCAANTGCTAACGA 

CAGCGTGCGCAGAGGAAGTATCCTACTAGTGTAAGAGGGACATTCGATGC 

GACCGCATAAACATTCAGTCTACAACGCGTGAGAGGATGGAACACCCCGC 

CCCTCTGAAGGCTAGACAACCATGAATATGTGCAGAGGAAACACAGAATT 

CCAAAGGTGAGAACATATGTAGGATCGCGCCACCCGAGATTGAGTGAAGA 

TATACATATATACTTATATGGATCTACAACATGGCGAACCGAACGTAGCA 

NAATAGTAGATATAATTGTAATACTGAGCTACCGACAGAAAGATACACAC 

GAGTGTACACACATCACACGCAGAGTGGTACCAAATTCACACCATGCGAG 

CCACAATGTGACACGGAGGAGCACAGCATGGGCGCCACTATGGAGGAGAA 

ACTACTGCAACCCACATCTGATGGACTGACCGCACGGACGGGACGTGTCT 

ATACATACAGATACATCNGATGGAGGAAGATGCATGTGCGATGATATCAT 

CGTCGCAAACTCATATGTCGAAGAAGATATGNGTCAACTCAGCACTACTC 

ACACGATACGTGAACAGGAGTGACTAGGACATCNCATGGTGTGTCGGCGC 

GTGCACGTGATATCAAACTCTCTGATCAACCACACACTATATAAGGAGTA 

TCGAGCGGCGATGGAACACCCCCTCACAGCATACGTATATGCACAACGTC 

TGAACACTCTNGAGACACAGTGGAAGG 

SEQ ID No 76 (>Contig51) 

GATCCAGTTACGCCCCGCCGCCTCGGTCACGCCGGGGTTTTCGGCGTCGA 
CC3:-GGACGGTCGGGGCGACAACCGGGGGTTGTCGTCAGCGGTTCGCGTG 
GArC-TGCGCGACGAAGTTGTCCGCCGCCGAGTCGTCCTGCGTGCCGCCGC 
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GCTGCCGTACGAGCAGCCCGAGGCACCGCGCGAGCTCCTGCCGCTTCCGA 
TGCGCGGCGGCCTGCTCGGCCTCGTGCTCCGCGATTCGGCGCTGGCGCCT 
GCTCTCCTCGGCGCGTTCCTGCGTCCGCGCGAACAGCACCTCCTGCACCT 
CCGCGAGTTCCTCGGCGGTAGCGACCCGCACGTGATCCGCACGTCCTCCG 
5 AGACCGGTGACGACGTCAACCGCGGAGTAGTACGCCTCCGCGCGGACGAC 
15 GTCGTCGAGGTCGAGTTCAGCGTCGGCTCCGACGACCAGGCGCATCGCGT 
CGGAGGTGAGGACGACGCCACCGTCGAGCGGCGCGGCCTCCGGCTCGTAG 
ACGTACTGGAGGTCGGTCTCCTCGTCGTCGTCACCGTCGAGGTCGACGTG 
CCGCCGCAGGCCTCGGGTCCACTCGATCGCGCGTCGTCCGGCGAGGGCTT 

20 

10 CCTCGTACTGGGCCCACCACGCGCGCAGCTGCTTCGGCGTGCCGTAGCCC 
TCGGCCATGTCGGGGTCGAGCCCGGCGACCTCGATGTCCCACAGTCGGTA 
GAGGATCTGGAACGGCGTCATGGACTTCCGGCCCCGGCCGGTCTTGGAGT 
CCAGACGGGCGGTCTCCATCGCAGCTGCGCCGGCGGCTTCGAGGTCCTGG 
TCGACGGAGTCGGGCCGCTCTCGCTTCCCGTCCTGGTTCTTGGTGAGGTA 
1 5 CTCGATCAGCGCGACGTCGTCAGCTGACCGGACGATCGAGACCATCACGC 
CGTGGCCCTTGCCCTTGCACTTGCAGCCGGGGGTGTCGCAGTCGGTCGAG 
ZO GGCTCGAACTTGGGGTCAGCCCGCTTGAGGGCGCCCGCCCACATCTCCCG 
GAGCCAGTCCTCCCAGTCCCCCAGGTCCGTCTCGGAGGGCTCAAAGTGTC 
CGACGACGTCACCCTTGGCCGGGGTGCCAGAGAGCTCGCCGCCGAGGAAG 
2 0 ACCAGCAGGTTGAGGTGGGGGTGGTAACCGTtCTTCTTGGACCGGGTGAC 
CTCAGCCGCGCGGACCATGCCGATG7AGCCGATCCGGTGGCGGATGCCGT 
. CCTCAGCGGGACGGACGTACTGCGTTCCGTCCTTCCGGGTGCGGCGGGCC 
TCAGGGCGGCCGTAGAAGGCCGGGGCCGTGAGCATCCGCTGGTAGGCACC 
GGGCGCGCGGCGGGGCTTGCCCGACCGGTCGAGGACCGGGGCGCCCTTGT 
2 5 CGTCCAGGAGAGGCCCGCCCCAGAGCGCGGCGACCAGGCTGTCGAGGTCG 
GTGGTCTGGTTATGCCGGGCGGTGAGGACGACAACGGCGAGCGTGCCGCC 
GGCGGCGAGGTGCCGCAGAGCACCGGTCTTGATCTCCTCGGTCCGGCCAC 
GGCGGATCGCGGAGGAGCACTCCGGGCATAACCAGATCCGCCCGCAGCGG 
ACCAGGCCGATCGTGACGACGTACCCACGGCTCGACTTCGCGTAGATCAC 
30 GCCGGTGTCCGGGTCGAGGACCCGCCGCCCGCACCCGCCGCAGGCGTCGA 
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TCCCGGAGACCCGGTTGAGCACCTrGCGGCCCTGGTAGCGGCGTACGGCA 
GCGGTCGCCGCGCGCCTTGTGGTCTGTTCCGAAAGGGCTGCCGCCCTCTC 
GGACTCTCCCGTTCCTCCCACGACTGCCACTTCCGCAAAGTCGCTGGTCA 
GTGGGGGGTGGGAAAACTCTGTCAACCCTTTACCTAGGCGTCCCTTTTTG 
CCAGGGGCGGTCTCACGGGCGGCCTCGGCGGCTCGGTCGGCGGCCTTCCG 
GGCCCGCGCGGCCCGCTTCTTGCACGCCTCGGAGCAGAACTCCTTGGCTC 
GCTTGCCGGGGGTGATCGTGAGGGCGGCGCCGCAGCGGCAACGGGGGCCG 
GCGGGGACGCGGGCGGGCGACTGAGTCGGCGCGCCGATCAAAGAGGGGGT 
TGCGGACGCCAAAGCGTCCCTTACGCTGGACACAGACGAGTACCTTGGTT 
GGTAGCCGGGTGGACGTCAGAAGCGGTCAGGGATTAGGACCCCTGGCCGT 
TTCGCTTTTTCTGGAGTTGTTCGGGTAGATCCTGCCGCATCGCCCGCCTC 
ACGCGTGGCTCGCCGCGCGGATGCCTCAGAGGCCCCACCGGTCGTCAGGA 
CGCAGACGTCGGCGTGCTCCTGGTGGTGAGTCACCAGCTCGACCACACGG 
GCGCGGCCGGCGACGTCTC 

SEQ ID No 77 (>Contig52) 

CGGGATCTGGCCTTCATTAACCAACGACGGGGCAAACATAATAGGCTGGG 
CATTGCGCTTCAGCTCACCACAGCCCGTTTTCTGGGAACATTTCTGACGG 
ATTTAACTCAGGTTCTGCCTGGTGTTCAACATTTTGTCGCGGTACAGCTt 
AATATCCACCGTCCAGAAGTTCTCTCCCGCTATGCTGAACGGGACACTAC 
CCTTAGAGAACATACTGCATTAATTAAGGAATATTACGGCTATCATGAAT 
TTGGTGATTTTCCATGGTCTTTCCGCCTGAAGCGTCTGCTATATACCCGG 
GCGTGGCTCAGTAATGAGCGACCGGGTCTGATGTTTGATTTTGCCACTGC 
ATGGTTGCTTCAAAATAAGGTATTACTGCCCGGAGCAACCACACTAGTAC 
GTCTCATCAGTGAAATTCGTGAAAGGGCAAATCAGCGGCTGTGGAAAAAG 
CTGGCCGCACTGCCGAACAAATGGCAGGCAGCTCAAGTGATGGAGCTTCT 
GGTCATTCCGGAAGGTCAGCGTGTATCAGCACTGGAACAGTXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGCTGGAACGATAT 
ATCCGATTACGAAGTCTTGAGTTTTCCCGACTGAACTTTTCCGGTCTGCC 
TGCCATTCAACTGCGTAATCTGGC7CGTTATGCTGGCATGGCGTCGGTAA 
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AATATATCGCTCGAATGCCACAGCAGAGAAAGCTTGCTGTACTTACTGCA 
TTCGTTAAAGCACAGGAAATAACGGCATTAGACGATGCCGTTGATGTGCT 
TGATATGCTAATTCTGGACATTATCCGCGAAGCAAAGAAAACCGGGCAAA 
AAAAAAGACTCAGGACACTGAAAGATCTTGATCAGGCCGCATTGTTACTG 
5 GCGCGGGCATGTGCATTGTTGCTGGATGATAATACAGATGTCCCAGATCT 
^5 CAGGCAGGTTATCTTCAAGTGCGTACCCAAAAACAGACTGGCAGAATCTG 
TAAGCAAGGTTAATGAACTTGCTCGTCCACAGAACAXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXAAACGTTTTCTTCCGGCGGTGTT 
GCGGGACCTGCATTTCCGTGCGGCACCGGCAGGTGAACATGTACTGGCTG 

20 

: 0 CGATTCATTATCTGGCAGAACTGAATGGTTCGAAAAAGCGCATCCTTGAT 
GATGCGCCTGAACATATTATCACCGGTCCCTGGAAACGCCTCGTATACGA 
TGCGGAGGGACGGATACAGCGTGCAGGTTATTCACTATGTTTGCTGGAAC 

25 GCCTTCAGGATGCACTGCGCCGCCGGGACATCTGGCTTGAAAACAGTGAT 
CGC7GGGGAGATCCTCGCGAGAAGTTGTTGCAAGGTGAAGAGTGGCAGAC 
1 5 TCAGCGTATTCCTGTCTGTCGGGCACTGGGACATCCTGTCGATGGACGTA 
AAGGTGTGCAACAACTGGCTATTCAGCTGGATGAGACCTGGAAAGCCGTG 

3^ GCATCACGATTTGAAAAGAATGCGGAAGTTCATATCTGTAATGAAGGTAA 
ATATCCATCCCTGACTATCAGTTGTCTGGAGAAACAGGAAGAGCCACCAT 
CATTGCTTCGTCTAAATAATCGGATCAAACAGCTACTCCCACCGGTAGAT 

2 0 TTAACGGAACTGTTACTTGAGATAGATGCCCAGACAGGATTTACACATGA 

35 

GTTTGCGCATGTCAGAGAATCTGGTGCTCGAGCGCAAGATTTGCACATCA 
GT7TATGTGCGGTATGAATGGCTAAGCCCTGTAATATGGGCCTGAACCCG 
TTGATAAAGCACAATATACCAGCATTGACCCGCCATCGGCTCAGTTGGGT 
^ GAAACAGAATTACCTTCGTGCAGAAACGCTGGT 
25 

SEQ ID No 78 (>Contig53) 

ATTCCACGCGCTCACGGTCAGCTTCGACCCGCGCGAGCGCCCGGCGGCCG 
CCTCGCAGAAGCGCGCGGTCACGCrGTCCGAGCTCGGCGCGGACGCGCAG 
GCGCCGGAGTGGCCGTTCCTCGTCGGCGACGAGGCGGCGACCCGCGCGCT 

3 0 CGCCGAGGACCTCGGGTTCCGCTACGCCTACGATCCGACCACCGATCAGT 
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ACGCCCACCCGGCGGCCGTCTTCGTGCTGACGCCGGACGGGCGGATCTCC 

CGGTACCTGTACGGGACGGAGTTCCCGGCGCGCGATCTCCGGCTCGCGCT 

CCTGGAGGCGAGCCGCGGCGGTATCGGCACGATCGTCGATCGGGTGATCA 

TGACCTGCTATCGCTTCGACCCGGCGAGCCGGAGATACGCTCCGTTCCTA 

CTCGGCTTCCTCCGGCTCGGGGCGGCGGCCATCCTGATCACGGTCGGCGG 

GCTGCTCGCCGTCCTGTGGCGGCGCGAGCGCCGGCGGCCAGGTGCTCGCA 

CGAGCGCCGCCGTCGGTCGTGACGCCGTGGCCGACCGCCAGGGGAGGTCA 

CCATGATCAACGAGCTCCTGCGCAAGCTTCTTTTTCTGTCCGGCCAGTGG 

TCGACGATCGTGTTCGACATTTACAAGCTGCTTTACTTCGTGATCTCGGT 

GACGATGGCCGGCGCGACGCTCGTCGCCCTGTTCGCGGCCTACCTGATGA 

TCCGGTACCGCAGGCGCCAGCGGGATGTTGAAGGCCCGTTCCCCGGAGCG 

ACCGCGAGGCCTCCGCTCCTCCTCGAGGTCGGCATGGTGCTGGGCCTCAT 

CGTCCTGTTCCTCGTCTGGTGGGTCATTGGAATGCGGCAGTATGCAGAGC 

TCCGCGTCGCCCCCGCGGACCCGGTCGTGGTGTACGTGACCGGGAAGCAG 

TGGATGTGGAAGTTCGCCTACCCGGAGGGCCCGAGCTCGGTGGCGACGCT 

CTATGTGCCGGCGCGTCGGCCGGTGAAGCTCGTCATGACGTCCCGGGACG 

TGATCCACAGCTTCTTCGTCCCCGATTTTCGCATCAAGTACGATGTCGTC 

CCCGGCCGCTACACCACGCTGTGGTTCGAGGCGACCGCGCCGGGCGCCTA 

TCAGATCCTGTGCACCGAGTACTGCGGGACGAACCACTCCACCATGCGCG 

GCGAGGTGATCGCGCTCGAGCCCTCCGATTTCGCGCGGTGGCTCTCCGAC 

CGCGGGCGGGGCGCCGGTATCGCCGGACAGGAGTACACGCCGCCGTCGAC 

GCCGGGCGAGGGGATCCCGCGCGAGCCGCTCAGCCTCGTCCGGCTGGGCG 

AGAACATCGCGGCCGAGGAGGGCTGCCTGCGCTGCCACACGCCGGACGGG 

ACACCGCACATCGGGCCGACCTGGGCCGGCCTCTACATGTCGGTCGTCCC 

GCTGGAGAGCGGCGGCGCCGCGGTCGCCGACGACGCGTACATCACCGAGT 

CGATGATGGATCCGCTCGCCCGGATCCACCGCGGCTACCAGCGGGTCATG 

CCCTCGTTCCTCGGCCGGCTCCAGCCGGCGCAGGTCGCCGCCATCGTCGA 

GTACATCCGGTCGTTGAGGGGCGTCGCGCCGGAGCCGGGCGCGCGGACGC 

CGCTGCCCGAGGGCCCGCCCTTCCTGCGCTCCGGCCCGGAGCGCCCCGCC 

CCGCTCAGCGGGGGCGCGCCGGTCGGCCCGATCGAGGGCGGCAAGCCCGG 
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GGAGGAGCTCCGATGAGCACGGAA3CGTACGAATCTCTGCCCGACGCGCC 
^CCGA^GGCCCGAGCCCCGACTACCTCCATGTTTACCGCGGGGTGACG 
GAGTGGCTCACGACCACGGATCACAAGCGGATAGGTCTCATGTTCTACGC 
CGTCATCGTCGGGAAAGCTTCTTCCTCGGAGGCATATTCGCCCTCATCAT 
5 GCGGACCGAGCTCCTCACGCCCGA3CGGACCATCATCGACGCGGCGACCT 
15 ACAACCGGATGTTCACGCTGCACG3CGTGATCATGGTCTGGCTGTTCATG 
ATCCCGTCGATCCCCAACGCGTTCC-GCAACTTCGTCCTGCCGATCATGCT 
CGGCGCCAAGGACCTCGCGTTCCCCCGGATCAACCTCGCGAGCITCTACA 
TCTACCTCCTCGGGGCGGCGATCGCGATGGGCGGCATGATCGCGGGCGGC 

20 

1 0 ACGGACACCGGCTGGACGTTCTAC CCGACGTACAGCCTGAAGACGCCGAT 
GACGCTGTTCCCGGTCGTCTTCGGtATCTTCATCGTCGGCGTCTCGTCCA 
TCATGACGGCGGTCAACTTCATCG73ACCACGCACACGATGCGCGCCGAG 
25 GGGCTCACGTGGAGCCGCCTGCCGCTCTTCGTCTGGAGCACCTACGCGAC 
GAGCATCATCCTGCTCTTCGCGACCCCGGTCCTCGGGCTCTCGATCCTGC 
15 TCATCGGCATCGACCACGTGACCGCGCTCGGGATGTTCGATCCCCGGTTC 
GGCGGCGATCCGGTCCTCTTCCAGCACCTCTTCTGGTTCTACTCCCACCC 
^0 CGCCGTCTACATCATGATCCTGCCGGCGTTCGGCGTGGTGAGCGAGGTCG 
TCTGCACGTTCGCGCACAAGCGCCCCGCGTCCTACTGGGCGATCGCCATC 
TCGTCGCTCGGGATCGCGTTCGTGGGGTTCTGGACGTGGGGCCACCACAT 
20 GTTCGTGGCGGGGATGAGCGAGTACGCCGCGGACGTCTTCGGCGTGCTCT 

35 

CGATGTTCGTGGCCATCTTCTCGGCCATCAAGGTCTACACGTGGGTCGCG 
ACGCTGTACAGGGGCTCGATCCACrTCAACACGCCGCTGCTCTACTTCAT 
CGCCTTCCTCTTCCTGTTCGTCTTCGGGGGGATGACGGGCGTGGCCGTCG 
^ CCACGCAGTCGCTGGACGTGCACT GGCACGACACATACTTCGTTGTGGCG 

2 5 CACTTCCACTTCATCATGGTGGGCGGGACGCTCACCATGTTCCTCGCGGC 
GGCGCACTACTGGTTTCCGAAGATGTTCGGGCGCCTCTACTCGGAGCGCG 
TCGGGCTCCTCTCGGCCGCGTCGGrGTTCCTCGGCTTCTTCTTGACCTTC 
TTCCCGCAGTTCCTCCTCGGGAACATGGGGATGCCCCGCCGCTATTACAG 
CTACCCGCCGCGCTACCAGTGGCTCCACGTGCTCTCGACCGGCGGCGCCT 

30 ACCTGCTCGCCGCGGCGCTCGTGArCTCGCTCCTGAACCTCGTCATCGCG 
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CTCAAGTGGGGCCGGAAGGCCGGGAGGAACCCCTGGGGCGGGCGCACGCT 

CGAGTGGATGACCGGCGAGCCCTTGCCGCCCAAGCACAACTTCCCGGTCG 

CGCCGCTCGTCCGCCGCGGCCCGTACGAGTTCCAGCTCTCCGAGGAGGAC 

GCCCGTGCGACAACCACGCCCGCTGCGTGAGCAGTTCGAAGATCTCGAGA 

AGCAGACGCACGCGGCCCGCCTCGGGATGTGGTTGTTCCTCGGGAGCGAG 

GTGCTCCTCTTCACCGGGCTCTTCGCGCTGTACGCGGCGTACCGCGAGCT 

CTACCCGCGCGATTTCGCCGAGGCCATCGCGCACAACAACGTCGCGATCG 

GCACCACCATGACGCTCATCCTGATCGGCAGCAGCTTCACCGTCGCCATG 

GCGGTGCACGCCGTCCGCGCCTCCCACCCGCGGCGCGCCGCGCTGTTCCT 

CGCGGTGAGCGTGGCGATCGGGATCGTGTTCCTCGTGCTGAAAGGGATCG 

AGTACGCGCAGCACTTCCGCGAGG3CATCTTCCCGGCCGGCGCCTACCGC 

TTCGCGGAGCTCCCGACGTTCGGCGCGCAGATGGCGTTCACGCTGTACTT 

CGCCATGACGGCGCTCCACGCCCTGCACGTCGTGGGCGGGGCCGGCCTCC 

TCACGGGGGTCGCGTGGGGGTGCTGGAAGGGCCGGTACTGGGCATACGAC 

CAGACGCCGGTGGAGCTCAGCGGCCTCTACTGGCACCTCGTGGACATCAT 

GTGGATCTTCATCTGGCCGCTCCTCTACCTGACGCGCAAGTGACGGCACA 

GCCCGGAGACGACCATGCCGCAAGAGCACGTCGCGGAAAGCACGCCCTGG 

ACCCGTTACCTCATGGACGCTGATGGCCCTCATTGCCCTCACGCTCCTGT 

CGTTAGCGCTCTCGTTCTTGCGCACGGGGGCTTGGGAAATACCGATCGCG 

CTGCTCCTCCCCGTGGTGAAGAGCGTGCTCGGGCTTGCTTTCTTCTTGCT 

CCGTCGAGGTGCATTAAGGTCATCAAACGCCTTTTTCAAACATTGGCTGC 

TGGTGTACTTACTTGGACGTTTCATACTGTCTCTTTTATGGCCCCCAACG 

GTCATTACGTCGACCTACACAATTTCTCTTCCCGCACCCTCCTACCTGTA 

TCTCTAAGCACTGCCTTGCGTCCTGCTCTATTACATTCTACTCCGGCTGT 

CCATGTGTGGGATTATATGCGCGAGGTACCTATTCCGCCGTGGAGTCTCC 

ATTTACCTCTTGGACCTTGCCCGrrCTGAATCTCGATCTCCTCATGCGTT 

GGTCCACCATGTATTACCTCCTAGAATCTTATACTCCATATCTCTATATA 

TCTAGTTGTGCGTGTAATTGTGTCATATATTATCGCCACTGCTGTATGAA 

TACCGTGCCGACGTGCTATATACGAAANTACTCCTCGGTCGATATCTCCA 

CCTCATATATACCTCCGAGTGTAGTATACGCACGAGTGTATATACTCTTC 
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CTCTGGTCACGCGACTTCGTGCTGATATGATACCATCGTTCCATGTTACG 
CGAAGTTACTCATAAGATCTCCTCACACATCAACGAGTGTACTCCTATGT 
GTTTCATACAAACTCGATACCCTTCAGAGTAGTGTCATGCCTATGTGGTA 
TGCATAATGTTAGTATACTTT 

S3Q ID No 7 9 {>Contig54) 

TGGGAAAGAGGGCCACAGGGGATGTAGCAGGACGCTTAATAGTAAATGAC 
GAGGGTGTGCCGACGAGACCCGTAGGAAACAACGGGCACAGACGAGAGCA 
ATAAAGGGGGTTGGAAGGTACCCCGGATAGAGTAGAGAAGGCTAGCGGAC 
GAGTAAGACGCGGAGGAAATAAGTCGGCGTCGTAGAAGTTCTGTGGAGAA 
GGTACGACTCTTAAAGACCTAGGCGGGAGACAGTTTCCACCCGAGGCAGA 
GCAAGACCACAAGATTCAGAGGGAGTAAGGAGTTCCGAATTGGAGAGGTT 
GAGGGGCGTGTGAGCCGTCAAGTGGGGCGCGTACGCAAAGAAAGAAGCGT 
CCATGTCAGAGGCCCAGCGGCCGTTGCGGCCCTACATGGGTGATCACGGT 
GGGCCGTAGGCGGACCGGAGATGAGCGCGGCTCCGCCCACCGGACGGCGA 
GGGGCACGGCGCCTTCGTCCGGGGCGCTCGCGCGCGATCTCGCGCGCGCG 
GGGGGCGCCGGCGTCGCCGTCGTCGGCGACGGCCAGCCGCCCATCGTCCA 
CGCCCTCGGGCACGTCATCAACGCCGCGCTCCGCAGCCGGGCGGCCTGGA 
TGGTCGATCCTGTGCTGATCGACGCGGGCCCCTCCACGCAGAGCTTCTCC 
GAGCTCGTCGGCGAGCTCGGGCGCGGCGCGGTCGACACCTTGATCCTCCT 
CGACGTGAACCCCGTGTACGCCGCGCCGGCCGACGTCGATTTCGCGGGCC 
TCCTCGCGCGCGTGCCCACGAGCTTGAAGGCCGGGCTCTACGACGACGAG 
ACCGCCCGCGCTTGCACGTGGTTCGTGCCGACCCGGCATTACCTCGAGTC 
GTGGGGGGACGCGCGGGCGTACGACGGGACGGTCTCGTTCGTGCAACCCC 
TCGTCCGGCCGCTGTTCGACGGCCGGGCGGTGCCCGAGCTGCTCGCCGTC 
TTCGCGGGGGACGAGCGCCCGGATCCCCGGCTGCTGCTGCGCGAGCACTG 
GCGCGGCGAGCGCGGAGGGGCGGATTTCGAGGCCTTCTGGGGCGAGGCAT 
TGAAGCGCGGCTTCCTCCCTGACAGCGCCCGGCCGAGGCAGACACCGGAG 
CTCGCGCCGGCCGATCTCGCTCAGGAGCTCGCGCGGCTCGCCGCCGCGCC 
GCGGCCGGCCGGCGGCGCGCTCGACGTGGCGTTCCTCAGGTCGCCGTCGC 
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TCCACGACGGCAGGTTCGCCAACAACCCCTGGCTGCAAGAGCTCCCGCGG 

CCGATCACCAGGCTCACCTGGGGCAACGCCGCCATGATGAGCGCGGCGAC 

CGCGGCGCGGCTCGGCCTCGAGCGCGGCGATGTCGTCGAGCTCGCGCTGC 

GCGGCCGCACGATCGAGATCCCGGCCGTCGTCGTCCGCGGGCACGCCGAC 

GACGTGATCAGCGTCGACCTCGGCTATGGGCGCGACGCCGGCGAGGAGGT 

CGCGCGCGGGGTGGGCGTGTCGGCGTATCGGATCCGCCCGTCCGACGCGC 

GGTGGTTCGCGGGGGGCCTCTCCGTGAGGAAGACCGGCGCCACGGCCGCG 

CTCGCGCAGGCCCAGCTCGAGCTCTCCCAGCACGACCGTCCCATCGCGCT 

CAGGAGGACGCTGCCGCAGTACCGTGAACAGCCCGGTTTCGCGGAGGAGC 

ACAAGGGGCCGGTCCGCTCGATCCTGCCGGAGGTCCAGCACACCGGCGCG 

CAATGGGCGATGTCCATCGACATGTCGATCTGCACCGGGTGCTCCTCGTG 

CGTCGTGGCCTGTCAGGCCGAGAACAACGTCCTCGTCGTCGGCAAGGAGG 

AGGTGATGCACGGCCGCGAGATGCAGTGGTTGCGGATCGATCAGTACTTC 

GAGGGGGGAGGCGACGAGGTGAGCGTCGTCAACCAGCCGATGCTCTGCCA 

GCACTGCGAGAAGGCGCCGTGCGAGTACGTCTGTCCGGTGAACGCGACGG 

TCCACAGCCCCGACGGCCTCAACGAGATGATCTACAACCGATGCATCGGG 

ACGCGCTTTTGCTCCAACAACTGCCCGTACAAGATCCGGCGGTTCAATTT 

CTTCGACTACAATGCCCACGTCCCGTACAACGCCGGCCTCCGCAAGCTCC 

AGCGCAACCCGGACGTGACCGTCCGCGCGCGCGGCGTCATGGAGAAATGC 

ACGTACTGCGTGCAGCGGATCCGAGAGGCGGACATCCGCGCGCAGATCGA 

GCGGCGGCCGCTCCGGCCGGGCGAGGTGGTCACCGCCTGCCAGCAGGCCT 

GTCCGACCGGCGCGATCCAGTTCGGGTCGCTGGATCACGCGGATACCAAG 

ATGGTCGCGTGGCGCAGGGAGCCGCGCGCGTACGCCGTGCTCCACGACCT 

CGGCACCCGGCCGCGGACGGAGTACCTCGCCAAGATCGAGAACCCGAACC 

CCGAGATTGAATGAGCCATGGCGGGCCCGCTCATCCTGGACGCACCGACC 

GACGATCAGCTGTCGAAGCAGCTCCTCGAGCCGGTATGGAAGCCGCGCTC 

CCGGCTCGGCTGGATGCTCGCGTTCGGGCTCGCGCTCGGCGGCACGGGCC 

TGCTCTTCCTCGCGATCACCTACACCGTCCTCACCGGGATCGGCGTGTGG 

GGCAACAACATCCCGGTCGCCTGGGCCTTCGCGATCACCAACTTCGTCTG 

GTGGATCGGGATCGGCCACGCCGGGACGTTCATCTCCGCGATCCTCCTCC 
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TGCTCGAGCAGAAGTGGCGGACGA^rATCAACCGCTTCGCCGAGGCGATG 
ACGCTCTTCGCGGTCGTCCAGGCC33CCTCTTTCCGGTCCTCCACCTCGG 
CCGCCCCTGGTTCGCCTACTGGATCrrcCCGTACCCCGCGACGATGCAGG 
TGTGGCCGCAGTTCCGGAGCGCGCTGCCGTGGGACGCCGCCGCGATCGCG 
5 ACCTACTTCACGGTGTCGCTCCTGTTCTGGTACATGGGCCTCGTCCCGGA 
^5 TCTGGCGGCGCTGCGCGATCACGCCCCGGGCCGCGTCCGGCGGGTGATCT 
ACGGGCTCATGTCGTTCGGCTGGCACGGCGCCGCCGACCACTTCCGGCAT 
TACCGGGTGCTGTACGGGCTGCTCGCGGGGCTCGCGACGCCCCTCGTCGT 
CTCGGTGCACTCGATCGTGAGCAGCGATTTCGCGATCGCCCTGGTCCCCG 

20 

10 GCTGGCACTCGACGCTCTTTCCGCCGTTCTTCGTCGCGGGCGCGATCTTC 
TCCGGGTTCGCGATGGTGCTCACGCTGCTCATCCCGGTGCGGCGGATCTA 
CGGGCTCCATAACGTCGTGACCGCGCGCCACCTCGACGATCTCGCGAAGA 
25 -GACGCTCGTGACCGGCTGGATCGrCATCCTCTCGTACATCATCGAGAAC 
TTCCTCGCCTGGTACAGCGGCTCGGCGTACGAAATGCATCAGTTTTTTCA 
15 GACACGCCTGCGCGGCCCGAACAACGCCGCCTACTGGGCCCAGCACGTCT 
GCAACGTGCTCGTCATCCAGCTCCTCTGGAGCGAGCGGATCCGGACGAGC 
30 CCCGTCGCGCTCTGGCTCATCTCCATCCTCGTCAACGTCGGGATGTGGAG 
CGAGCGGTTCACGCTCATCGTGATGTCGCTCGAGGAAGAGTTCCTCCCGT 
CCAAGTGGCACGGCTACAGCCCGACGTGGGTGGACTGGAGCCTCTTCATC 
2 0 GGGTCAGGCGGCTTCTTCATGCTCCTGTTCCTGAGCTTTTTGCGCGTCTT 

35 

TCCGTTCATCCCCGTCGCGGAGGTCAAGGAGCTCAACCATGAAGAGCTGG 
AGAAGGCTCGGGGCAAGGGGGGGC3CTGATGGAGACCGGAACGCTCGGCG 
AGTTCGACGACCCGGAGGCGATGC7CCATGCGATCCGAGAGCTCAGGCGG 
^ CGCGGCTACCGCCGGGTGGAAGCG7TCACGCCCTATCCGGTGAAGGGGCT 

2 5 CGACGAGGCGCTCGACCTCCCGCG C TCGAACCTCAACCGGATGGTGCTGC 

CCTTCGCGATCCTGGGGGTCGTGGGCGGCTACTTCGTCCAGTGGTTCTGC 
AACGCTTTCCACTATCCGCTGAAC3TGGGCGGGCGCCCGCTGAACTCGGC 
GCCGGCGTTCATCCCGATCACGTTCGAGATGGGGGTGCTCTCCACCTCGA 
TCTTCGGCGTGCTCATCGGCTTTTACCTGACGAGGCTGCCGAGGCTCTAC 

3 0 CTCCCGCTCTTCGACGCCCCGGGC7TCGAGCGCGTCACGCTGGATCGGGT 
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CCTGGTCGGGCTCGACGACACGGAACCTTCCTTCTCGAGCGCCCAGGCGG 
10 AGCGCGATCTCCTCGCGCTCGGCC-CCAGGCGCGTCGTCGTGGCGAGGAGG 
CGCGAGGAGCCATGAGGGCCGGCGCCCCGGCTCGCCCCCTCGGGCGCGCG 
CTCGCGCCGTTCGCCCTCGTCCTGCTCGCCGGGTGCCGCGAGAAGGTGCT 
5 GCCCGAGCCGGACTTCGAGCGGATGATCCGCCAGGAGAAATACGGGCTCT 
15 GGGAGCCGTGCGAGCACTTCGACGACGGCCGCGCGATGCAGCACCCGCCC 
GAGGGGACCGTCGCGCGCGGGCGCGTCACCGGGCCGCCCGGCTATCTCCA 
GGGCGTCCTCGACGGGGCGTACGTCACGGAGGTGCCGCTCTCGCTCACGG 
TCGAGCTCGTGCAGCGCGGCCGGCAGCGCTTCGAGACCTTCTGCGCGCCG 

20 

1 0 TGCCACGGGATCCTCGGCGACGGCAGCTCGCGCGTGGCGACGAACATGAC 
GCTGCGCCCGCCGCCGTCGCTCGTCGGACCCGAGGCGCGGAGCTTCCCGC 
CGGGCAGGATCTACCAGGTCATCATCGAGGGCTACGGCCTGATGCCGCGC 
25 TACTCGGACGATCTGCCCGACATCGAAGAGCGCTGGGCCGTCGTCGCCTA 
CGTGAAGGCGCTTCAGCTGAGCCGCGGAGTGGCCGCGGGCGCCCT.CCCGC 

1 5 CCGCGCTCCGCGGCCGGGCAGAGCAGGAGCTGCGATGAACAGGGATGCCA 
TCTAGTACAAGGGCGGCGCGACGATCGCGGCCTCGCTCGCGATCGCGGCG 
CTCGGCGCGGTCGCCGCGATCGTCGGCGGCTTCGTCGATCTCCGCCGGTT 
CTTCTTCTCGTACCTCGCCGCGTGGTCGTTCGCGGTATTCCTGTCCGTGG 
GCGCGCTCGTCACGCTCCTCACCTGCAACGCCATGCGCGCGGGCTGGCCC 

20 ACGGCGGTGCGCCGCCTCCTCGAGACGATGGTGGCGCCGCTGCCCCTGCT 
CGCGGCGCTCTCCGCGCCGATCCTGGTCGGCCTGGACACGCTGTACCCGT 
GGATGCACCCCGAGCGGATCGCCGGCGAGCACGCGCGGCGCATCCTCGAG 
CACAGAACGCCCTACTTCAATCCAGGCTTCTTCGTCGTGCGCTCGGCGAT 
40 CTACTTCGCGATCTGGATCGCCGTCGCCCTCGTGCTCCGCGGGCGATCGT 

2 5 TCGC3CAGGACCGTGAGCCGAGGGCCGACGTCAAGGACGCGATGTATGGC 
CTGAGCGGCGCCATGCTGCCGGTCGTGGCGATCACGATCGTCTTCTCGTC 
GTTCGACTGGCTCATGTCCCTCGACGCGACCTGGTACTCGACGATGTTCC 
''^ CGGTCTACGTGTTCGCGAGCGCCTTCGTGACCGCCGTCGGCGCGCTCACG 
GTCCrCTCGTATGCCGCGCAGACGTCCGGTTACCTCGCGAGGCTCAACGA 

30 CTCGCACTATTACGCGCTCGGGCGGCTGCTCCTCGCGTTCACGATATTCT 
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GGGCCTATGCGGCCTATTTCCAGTTCATGTTGATCTGGATCGCGAACAAG 

CCCGACGAGGTCGCCTTCTTCCTCGACCGCTGGGAAGGGCCCTGGCGGCC 

GACCTCCGTGCTCGTCGTCCTCACGCGGTTCGTCGTCCCGTTCCTGATCC 

TGATGTCGTACGCGATCAAGCGGCGCCCGCGCCAGCTCTCGTGGATGGCG 

CTCTGGGTCGTCGCCTCCGGCTACATCGACTTTCACTGGCTCGTGGTGCC 

GGCGACAGGGCGCCACGGGTTCGCCTATCACTGGCTCGACCTCGCGACCC 

TGTGCGTCGTGGGCGGCCTCTCGACCGCGTTCGCCGCGTGGCGGCTGCGA 

GGGCGGCCGGTGGTCCCGGTCCACGACCCGCGGCTCGAAGAGGCCTTTGC 

GTACCGGAGCATATGATGTTCCGTTTCCGTCACAGCGAGGTTCGCCAGGA 

GGAGGACACGCTCCCCTGGGGGCGCGTGATCCTCGCGTTCGCCGTCGTGC 

TCGCGATCGGCGGCGCGCTGACGCTCTGGGCCTGGCTCGCGATGCGGGCC 

CGCGAGGCGGATCTGCGGCCCTCCCTCGCGTTCCCCGAGAAGGATCTCGG 

GCCGCGGCGCGAGGTCAGCATGGTCCAGCAGTCGCTGTTCGACGAGGCGC 

GCCTGGGCCAGCAGCTCGTCGAGGCGCAGCGCGCGGAGCTCCGCCGCTTC 

GGCGTCGTCGATCGGGAGAGGGGCATCGTGAGCATCCCGATCGACGACGC 

GATCGAGCTCATGGTGGCGGAGGGCGCGCGATGAGCCGGGCCGTCGCCGT 

GGCCCTCCTGCTGGCAGCCGGCCTCGTGTCGCGCCCGGGCGCCGCGTCCG 

AGCCCGTATCGCTTTCGCCCCGCGCTGGGCCCGTCCGCGGGCGAGGCCGC 

GCTCTGAAACGACGGCTCCGGCGCGGATGAGCGGCCCGAGGCGACCTCCT 

GCAACCCACCGCGCTGCGTACAGGGGTAAATCAACTGCATTCCAGGATAC 

GGACCGCGCAGCATAAACCCTCACCGGACAGACTGAATAATGCCGAACCT 

TGAACTTCTATGCCCATGCGTGCGGGATCATCATGCCCAACTATTTAAAC 

TGGTCCCTCCGGCAAAAGGAACGGACCAGCACCCAGAATAACCCCTGTTT 

GGCCAGCGCAAAAAAATGAAAACTCTTCTCCTTGCGGCTAAAATAAACGA 

CTCCGGGGAACCGAATGGTATGCAATACACCATGGCAACGCGATTCGTCC 

TAACCTTAAACAAACTTCCCAGCGACTCGTCCGTCGAACGCTTCTGACGC 

AAGACCATGACCCCACGAGAACCGGGCGGCGGACACACTGCCAGTGAAAC 

TCGGCCTAGGCCCGCCCTGCCTTCACGTATTCACCGTGGGGCGGTCCAAA 

ACTAAAAACAATACGTGACTCTTCTCAAATATCGTCGGATAAAGGCCAAC 

ACGCGTACCTCCCCCTAAAGGGAAGAAACCCCTACCAGGGTGGACCGTAT 
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CCACCCGTGATCCCTGAAACATCTCCAGCCGCGTACAAATTAGGCTTTGA 
CAAAACC 

SEQ ID No 80 (>Contiq55) 

GGGGAAAGAAGTCGGGAGCAGCAGAAGAGCGCAGCGATTGAAAGCACGAC 
GCAAGGGCAAAGGACATNAGACGACAACAGAAAGGCGAAGACATGAGAAG 
AGGAGAAAGACTCGGAAATGGGCACAAAGCGAGAAAAAACGTAACGCTAT 
CTGAGAAAGAACACCATAAAGTCAGACTGGGGTAAAGCCATACACGCAGC 
GAAAGCAGACAAGCAAAAGTCATTAACAGATGCAACAAAGAACAAAAGAG 
AAAGGAAGGACACATGAAGGAGAAAGGCGGCTTCGAGAACTAAAGACGGG 
AGCCAACAATATACCTTTCACAGGGGCGAAGAAGGGCCCAAGGTTCAATC 
GAAGACGATTGAATCCAAGGAAGTCCAATCGGAAGAAAAAAGCATATTGA 
TAAACCAGACAACGACAGCAGGCCTGAACGTAGGAGCGAGATCGTGAGAC 
ATCAGTAGGCAAAACAAGAGCGCTACACCCAGGGGTCGTCAACCTAGAAA 
GGCGCGTCCTCAAGCCGGTAGCGGCCGCGCGCGACCAGCCCGATGCGGGC 
GCCTTCGCGCGCGAACCGCCGCACCGTCGCGCGCCCCACGCCCGCTGACG 
CCCCCGTGATCACCACGACCTCCGGCCGCCTGGAATCGCCCATGCCCGTC 
GCCTCCCGCCTCGCCCGCGGCGTCAAGCAACGTGAATGCCACCTGAGCGT 
GTCACTTCCTCAAGCTCGACAGCACGTCCTTGATCCGCTCGGTCGGACCC 
GCCGTGAGCGACCAGGGGCTCGGCTGCCGCACGACGTGCAGGTCGCCTCG 
ACGCTCGAGCACCTCGAACCGCGTCGTTCCCTGCTCGGTCCGGACGAACC 
GGAGCGCCACCACGGCCGTGCCCACATGGAGGTTCGAGAGGGTGATCTCG 
GGCAGCCACGCCGGCAGCGCCGGGTCGACCAGCAGCAGGTCGAGCGGCGC 
GAACGGGTACAGCCCGAGCATCGCCTGGAGCATCGCGAACACGCTCGAGC 
AGGACCAGGCCTGCGGCCAGTTGGCCTTCGGGTACAGCGCGGGGAACGGG 
TGCTCCGCGTCGCGCGGGTGGCCGCTCCAGCACTCGGGCAGCCGGTGGTG 
CTCGAACAGCGCCGCCGCCTCGAACACGGCGCGGCACAGGAGCGCCACGT 
GCCCGTGCAGGCCGTACCGCGCGAGCCCGAGCGCGATCGCGCCCTGATCG 
ACGGGCCAGACCGTCCCCCGGTGATAGCTGTACGGATCGAACGGGGGGTG 
ATCGGCCGAGAGCGTGCGGATACCCCAGCCGGAGAACATGTCCGCCGCGA 
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160 



10 



ACATGCGGATGGCGGTGGGCTCGGCGAGGGCCTGGTCCACGATCCCCGCC 
GCGAGGCAGAGCCCCGGATCCGAGCCGATCGAGCGGATCTGGCGCTTGTC 
CGGGCCGAGGCCCATCGCGAAGGTGCGCGCGTCGGGCATCCAGAAGGCGT 
CGTTGAACCGCCTCTGCAGCTCGAGCGCCTCGGCGAACAGCCGTCGCGCG 
TCGTCCTTGCGGCCGAACCAGAAGAGCAGCTCGGAGAGGCGCAGCTTCGA 
15 CAGGAACACGAAGCCCTGCATCTCGCACGTCCCGATCGGCGGCCGCACCT 
GAGAGCCGTCGGCGTGGACGATGGCGTCGTCGGAGTCCTTCCAGCCCTGG 
TTCTGGATCGACGCGCTCGAGCGGGGCTCGTACTCGTAGAACCCGTCGCC 
GTCGAGATCGCCCTCCTCGTCGATCCAGCGCATGGCCCTGAGCGCAGGCT 
10 CGATCAGGCGGCCGACGCGCTCGCGATCGCCGGTCCAGTGCCAGAGCTCC 
GAGGCGGCCACCGCGTAGAACATCGTCGAGGTCGCCGACGCGTACGTGCG 
CCCCAGCGGGTTGTAGTTGAGGTCCGAGAGCGCTCCGTCCCTGGCCTGAT 
GCAGCATCCGGTCGGGCTGCTCGTCGCGCCAGTCGTCGACGACGCGCCCC 
TGCCAGCGCGGGAGCACGAGCGC7GTGCCTGCGAGGATGTCGGTCGTCAG 
15 CGCGGCGGCCTGCGTGCCCGCGGCGAGCGGGTCGCGGCCGAAGAGCCCGA 
TGTAGATCGGCAGTCCGGCGGCCACCGTCCAGGAGCGCTCGTCCTGGTCG 
ATATCGTACATGCGGAGCGCGATGAGATCGCGCTTGGCTCGCTCGAGCAC 
GGAGAAGACCGTATGCGAGAGCGTGTCGGCGCCCGGGACCGAGAAGGACG 
TGGCGCGGTGGTGGAAGGTGCCGCGCGCCACGTCGCGCGCGTTTCGCGTG 
CCGAAGAACGAGCGGCAGCCGGCGAGCAGCGGGAGCGGTTCGCCGCGGAT ■ 
CAGGGCGATCACGTCGACGCAGCCGCGCCAGGTGCCATGGGGCTCGAGAT 
CGATCGAGAACCGGATCTCGCGGCCAGCGCAGGACGGGGGCGAGCCCGCG 
CTCCGGGCCCGTACGACGATCCCGGCGTCGAAGCGCGCGACGCCGGATTC 
40 TCCGGGGTGTTCATAGCGGTGCTCGGCGCGGTAGTCGCAGCGCAGCGCGC 

2 5 ATCCGTGCTCGGCGGGCTCGAGCG CCCACCGGGCGTCGCCGCGCTGGAGG 

CGCGGGCCGTCCGTCTCCTCCGCGTCGGCGAAGTCGGCGTCGATCTCGAG 
CGCCAGCGTGAAGCGCACGCGCTCCTGCGTGAAGCTCGCGACGTCGATGT 
CCTCGTGGAAGCCGTCCCCCACGAAGCGCGACAGCCGCAGCTCGACGGTC 
CGCGGCGCCGCGCCGCCGCCCTCGGGCGGCGGGGCGATGTAGTAGCCGAG 

3 0 CCAGCTGTCCGGCTCGACGGCGGAGAGCGCGACGGGCGCCGGCGGCCTGC 
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CGTCGATCAGGTGGCGGTAGAGGGAGAGGAGGCGGGTGTTGCGCACGAAC 

AGGCCGATGTGCGGCTCGGGGGCGATCGAGCCGTCCGGACGCATGCACAG 

GACCGTCCGGTTCTGGCTGATGTACAGGGAGCCGGCGCGGGGCCTCAGCG 

TGGCCAGCGAGCCGAAGGGTCTCTCGAGCGACATGGTCACCTCCAGGCGA 

GCCCGGGGTGAGGACTGCCAGGGGCGTGCCAACGACGTGGACGCGCTCAC 

GCCAAGCCGCTGGGCGGGCGCGGCGGCCCCTGCGCCCGAGGCTCAGCCGA 

GCGCGAGCGGTGTTTCGTCGGCGCGAGCGGGCCTCGTCGTCGCCTCCAGG 

TAGGCCTTTCCGATGTCGGCCTCGTCCACGAACCCGACGATCGCGCCCTC 

GCCGTCGACGACGGGGACCTCGCGGACGCCGTGCGCCACCATCGCCTCGG 

TCGCCGTCCGCAGATCGTCGGTGACCGTCACGGCCACCC^CGGCTGCATC 

GCGTCGGCGGCCACGGTCATCCGCTCGAGGTCGTGCTCCACCGCGATGAT 

CCGGAGCGACTCGGCGGTGATCATGCCGACCATCTTGCGCGACGGTTCGA 

GCACCGGGAACACCTCCTGCCAGCTCGCGTCGGCCGCCCGCCGGAGCATC 

TCGCGGGCCGGCGTCCCCGGCACGAACGTCACGAGCGCGCGCCCCTCGAT 

CATGATCTGCCGCACGCGGATGGTCTTGAGCACGTCGAGCGTCGGCACCG 

GGTGCGCGGGAGACTCGCGCTGGGTGGGGAGCTGCGCGTGGTAGAGCGAG 

TGCTTCCGCAGCGCGACGAAGGCGACGCCCTCGGCGAGCATCAGCGGAAC 

CAGGAGGTCGTAGCTGCCGGCGAGCTCGCAGACCATCACGAGGGAGCTCA 

CCGGCACGTGCGCGACGCCGCCGTAGAAGGTGCCCATGCCCACGAGCGCG 

AAGGCGCCCGGGTCGATGCGCGGATCGCCGAGCAGGAGCGCCGCCGCGCG 

CCCGAACGCGCCGCCGAAGAGCCCGCCGATGACGAGCGACGGCGCGAAGT 

CGCCGGCGCACCCGCCGCTGCCGAGCGTGAGCGACGAGGCGACGATCTTG 

GCC-GCGCAGAGCAGGAGCAGGAGCTCCACGCCGCGCCAGCCCGGGTGGAG 

CCACGTGGCGCCGGTGATCGCGACCTGGACGGCGCCGTACCCGCCGCCGA 

GCAGCCCGAGCCCTTGCCCGGGGCTCTCGATCCTGCGGCCGACGAACCAG 

AGGACGGGCACGCAGAAGAGGCCCAGCGCGAGCCCGCCGAGCCCCGGGCG 

CGCCCAGGGGGCGATGGGCAGGCGCGCCGCGATCGCCTTCACGCCGCCGA 

GGCACTTCAGGAAGCCGATCGCGACGAAGGCCAGGAGCAGCGCGAGCAGC 

GCATAGAGAGGGAGGTGCGACGGGACGAACGCATACTTCGGCGCGTGCGC 

GAii.GAGCGTCGACTCGCCGTAGAACGAGATGAAGACCGAGTAGGAGACCA 
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CGCTGGCGAGCAGCGCCGGGATCAGCGCCTCGGCCTCGAAGTCGTCGCGG 
TAGAGCACCTCGACGGCGAGCAGGGCGGCGCCGAGCGGCGTGCGGAAGAT 
GGCGGACATCCCGGCCGCGACCCCCGCGAGCATCAGGATGCGGTGCTCGC 
GCCGGCCGACCGCGAGCCCGCGCCCCACGAGCGAGCCGAGCGCTCCTCCG 
5 ACCTGCATGGTCGGCCCCTCGCGACCGCCGGCGCCGCCCGAGCCGAGCGT 
^5 GAGGATCGACGCGACCGCCTTGACCCACGCGACCCGCTTGCGCATCCGGC 
CGCCGTGGTGGTGGAAGGCCTGGATCATCGCGTCGCCGCCGCCGCCCGCG 
GCCTCGGGGGCGAGGCGCCAGGTGAGGATGCCCCCGGCCAGCGCTCCGAG 
CGCCGGGATCAGCAGCAGCAGCCAGAGCCGGACGCTCCGGTGCTCGGGGC 

20 

10 CGTCGCCGCTGAAGATGGCCTCGCCGTGGGCTCGAAGCCGCGCGTAGCCG 
GCGAGGCGGCCGAGGAGCAGCTCCTCGACGAGCTCGAGGGCGCCGAAGAA 
GAGCACCGCGACGAGGCCCGCGATGGCCCCGACGAGCACGGCATGCAGGA 
25 TCGTGCGCCCGACGAGCCTGAGATCGAGGGGCGCGACCTCCGAGAGGAGC 
GCGGAGAAGGGGCGCCGCCGTCGCACCACGCCGGCGCGCTCTGCTGGGAA 

1 5 TGGTTCCGTCACGGCGATGGT 

30 SEQ ID No 81 {>Contig56) 

GGATCCGGCCGCGAGCGGCTGCGGTGCCGATGCGCTCGACGGTGACGGGC 
GGGGTGATCGCGGGTCCGGAGCTCGGTGCGAGCTACTGGGCGGACAACCT 
20 TCGGCAGCCGGTGCGCTTCGCTGCGGCGGCGCAAGCGCTGCTGGAGGGTG 

35 

GCCCCGCGCTGTTCATCGAGATGAGCCCGCACCCGATCCTGGTGCCGCCC 
CTGGACGAGATCCAGACGGCGGCCGAGCAAGGGGGCGCTGCGGTGGGCTC 
GCTGCGGCGAGGGCAGGACGAGCGCGCGACGCTGCTGGAGGCGCTGGGGA 
^ CGCTGTGGGCGTCCGGCTATCCGGTGAGCTGGGCTCGGCTGTTCCCCGCG 

2 5 GGCGGCAGGCGGGTTCCGCTGCCGACCTATCCCTGGCAGCACGAGCGGTG 

CTGGATCGAGGTCGAGCCTQACGeCCGCCGCCTCGCCGCAGCCGACCCCA 
CCAAGGACTGGTTCTACCGAACGGACTGGCCCGAGGTGCCCCGCGCCGCC 
''^ CCGAAATCGGAGACAGCTCATGGGAGCTGGCTGCTGTTGGCCGACAGGGG 
TGGGGTCGGTGAGGCGGTCGCTGCAGCGCTGTCGACGCGCGGACTTTCCT 

3 0 GCACCGTGCTTCATGCGTCGGCTGACGCCTCCACCGTCGCCGAGCAGGTA 
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TCCGAAGCTGCCAGTCGCCGAAACGACTGGCAGGGAGTCCTCTACCTGTG 
GGGCCTCGACGCCGTCGTCGATGCTGGGGCATCGGCCGACGAAGTCAGCG 
AGGCTACCCGCCGTGCCACCGCACCCGTCCTTGGGCTGGTTCGATTCCTG 
AGCGCTGCGCCCCATCCTCCTCGCTTCTGGGTGGTGACCCGCGGGGCATG 
5 CACGGTGGGCGGCGAGCCAGAGGCCTCTCTTTGCCAAGCGGCGTTGTGGG 
^5 GCCTCGCGCGCGTCGCGGCGCTGGAGCACCCCGCTGCCTGGGGTGGCCTC 
GTGGACCTGGATCCTCAGAAGAGCCCGACGGAGATCGAGCCCCTGGTGGC 
CGAGCTGCTTTCGCCGGACGCCGAGGATCAACTGGCGTTGCGCAGCGGTC 
GCAGGCACGCAGCACGCCTTGTAGCCGCCCCGCCGGAGGGCGACGTCGCA 

20 

1 0 CCGATATCGCTGTCCGCGGAGGGGAGCTACCTGGTGACGGGCGGGCTGGG 
TGGCCTTGGTCTGCTCGTGGCTCGGTGGCTGGTGGAGCGGGGAGCTCGAC 
ATCTGGTGCTCACCAGCCGGCACGGGCTGCCAGAGCGACAGGCGTCGGGC 
25 GGAGAGCAGCCGCCGGAGGCCCGCGCGCGCATCGCAGCGGTCGAGGGGCT 
GGAAGCGCAGGGCGCGCGGGTGACCGTGGCAGCGGTGGATGTCGCCGAGG 
15 CCGATCCCATGACGGCGCTGCTGGCCGCCATCGAGCCCCCGTTGCGCGGG 
GTGGTGCACGCCGCCGGCGTCTTCC CCGTGCGTCACCTGGCGGAGACGGA 
^0 CGAGGCCCTGCTGGAGTCGGTGCTCCGTCCCAAGGTGGCCGGGAGCTGGC 
TGCTGCACCGGCTGCTGCGCGACCGGCCTCTCGACCTGTTCGTGCTGTTC 
TCGTCGGGCGCGGCGGTGTGGGGTGGCAAAGGCCAAGGCGCATACGCCGC 
20 GGCCAATGCGTTCCTCGACGGGCTCGCGCACCATCGCCGCGCGCACTCGC 
TGCCGGCGTTGAGCCTCGCCTGGGGCTTATGGGCCGAGGGAGGCATGGTT 
GATGCAAAGGCTCATGCACGTCTGAGCGACATCGGGGTCCTGCCCATGGC 
CACGGGGCCGGCCTTGTCGGCGCTGGAGCGCCTGGTGAACACCAGCGCTG 
40 TCCAGCGTTCGGTCACACGGATGGACTGGGCGCGCTTCGCGCCGGTCTAT 

2 5 GCCGCGCQAGGGCGGCGCAACTTGCTTTCGGCTCTGGTCGCGGAGGACGA 

GCGCGCTGCGTCTCCCCCGGTGCCGACGGCAAACCGGATCTGGCGCGGCC 
TGTCCGTTGCGGAGAGCCGCTCAGCCCTCTACGAGCTCGTTCGCGGCATC 
GTCGCCCGGGTGCTGGGCTTCTCCGACCCGGGCGCGCTCGACGTCGGCCG 
AGGCTTCGCCGAGCAGGGGCTCGACTCCCTGATGGCTCTGGAGATCCGTA 

3 0 ACCGCCTTCAGCGCGAGCTGGGCGAACGGCTGTCGGCGACTCTGGCCTTC 
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GACCACCCGACGGTGGAGCGGCTC-3TGGCGCATCTCCTCACCGACGTGCT 
GAAGCTGGAGGACCGGAGCGACACCCGGCACATCCGGTCGGTGGCGGCGG 
ATGACGACATCGCCATCGTCGGTGCCGCCTGCCGGTTCCCAGGTGGGGAT 
GAGGGCCTGGAGACATACTGGCGGCATCTGGCCGAGGGCATGGTGGTCAG 
5 CACCGAGGTGCCAGCCGACCGGTG3CGCGCGGCGGACTGGTACGACCCCG 
15 ATCCGGAGGTTCCGGGCCGGACCTATGTGGCCAAGGGTGCCTTCCTCCGC 
GATGTGCGCAGCTTGGATGCGGCGTTCTTCGCCATTTCCCCTCGTGAGGC 
GATGAGCCTGGACCCGCAACAGCGGCTGTTGCTGGAGGTGAGCTGGGAGG 
CGATCGAGCGCGCTGGCCAGGACCCGATGGCGCTGCGCGAGAGCGCCACG 

20 

10 GGCGTGTTCGTGGGCATGATCGGGAGCGAGCACGCCGAGCGGGTGCAGGG 
CCTCGACGACGACGCGGCGTTGCTGTACGGCACCACCGGCAACCTGCTCA 
GCGTCGCCGCTGGACGGCTGTCGTTCTTCCTGGGTCTGCACGGCCCGACG 
25 ATGACGGTGGACACCGCCTGCTCGTCGTCGCTGGTGGCGTTGCACCTCGC 
CTGCCAGAGCCTGCGATTGGGCGAGTGCGACCAGGCCCTGGCCGGCGGGT 
15 CCAGCGTGCTTTTGTCGCCGCGG7CATTCGTCGCGGCGTCGCGCATGCGT 
TTGCTTTCGCCAGATGGGCGGTGCAAGACGTTCTCGGCCGCTGCAGACGG 
30 CTTTGCGCGGGCCGAGGGCTGCGCCGTGGTGGTGCTCAAGCGGCTCCGTG 
ACGCGCAGCGCGACCGCGACCCCATCCTGGCGGTGGTCAGGAGCACGGCG 
ATCAACCACGATGGCCCGAGCAGCGGGCTCACGGTGCCCAGCGGTCCTGC 
20 CCAGCAGGCGTTGCTACGCCAGGC3CTGGCGCAAGCGGGCGTGGCGCCGG 
CCGAGGTCGATTTCGTGGAGTGCCACGGGACGGGGACAGCGCTGGGTGAC 
CCGATCGAGGTGCAGGCGCTGGGCGCGGTGTACGGGCGGGGCCGCCCCGC 
GGAGCGGCCGCTCTGGCTGGGCGCTGTCAAGGCCAACCTCGGCCACCTGG 
AGGCCGCGGCGGGCTTGGCCGGCGTGCTCAAGGTGCTCTTGGCGCTGGAG 

2 5 CACGAGCAGATTCCGGCTCAACCGGAGCTCGACGAGCTCAACCCGCACAT 

CCCGTGGGCAGAGCTGCCAGTGGCCGTTGTCCGCAGGGCGGTCCCCTGGC 
CGCGCGGCGCGCGCCCGCGTCGTG CAGGCGTGAGCGCTTTCGGCCTGAGC 
GGGACCAACGCGCATGTGGTGTTGGAGGAGGCGCCGGCGGTGGAGCCTGT 
GGCCGCGGCCCCCGAGCGCGCAGCGGAGCTGTTCGTCCTGTCGGCGAAGA 

3 0 GCG CGGCGGCGCTGGATG CGCAG G CAGCCCGGCTGCGGGACCACCTGGAG 
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AAGCATGTCGAGCTTGGCCTCGGCGATGTGGCGTTCAGCCTGGCGACGAC 

GCGCAGCGCGATGGAGCACCGGCTGGCGGTGGCCGCGAGCTCGCGCGAGG 

CGCTGCGAGGGGCGCTTTCGGCCGCAGCGCAGGGGCACACGCCGCCGGGA 

GCCGTGCGTGGGCGGGCCTCGGGCGGCAGCGCGCCGAAGGTGGTCTTCGT 

GTTTCCCGGCCAGGGCTCGCAGTGGGTGGGCATGGGCCGAAAGCTCATGG 

CCGAAGAGCCGGTCTTCCGGGCGGCGCTGGAGGGTTGCGACCGGGCCATC 

GAGGCGGAAGCGGGCTGGTCGCTGCTCGGGGAGCTCTCCGCCGACGAGGC 

CGCCTCGCAGCTCGGGCGCATCGACGTGGTTCAGCCGGTGCTGTTCGCCA 

TGGAAGTAGCGCTTTCTGCGCTGTGGCGGTCGTGGGGAGTGGAGCCGGAA 

GCGGTGGTGGGCCACAGCATGGGCGAGGTTGCGGCGGCGCACGTGGCCGG 

CGCGCTGTCGCTCGAGGACGCGGTGGCGATCATCTGCCGGCGCAGCCGGC 

TGCTGCGGCGGATCAGCGGTCAGGGGGAGATGGCGCTGGTCGAGCTGTCG 

CTGGAGGAGGCCGAGGCGGCGCTGCGTGGCCATGAGGGTCGGCTGAGCGT 

GGCGGTGAGCAACAGCCCGCGCTCGACCGTGCTCGCCGGCGAGCCGGCGG 

CGCTCTCGGAGGTGCTGGCGGCGCTGACGGCCAAGGGGGTGTTCTC'GCGG 

CAGGTGAAGGTGGACGTCGCCAGCCATAGCCCGCAGGTCGACCCGCTGCG 

CGAAGAGCTGATCGCGGCGCTGGGAGCGATCCGGCCGCGAGCGGCTGCGG 

TGCCGATGCGCTCGACGGTGACGGGCGGGGTGATCGCGGGTCCGGAGCTC 

GGTGCGAGCTACTGGGCGGACAACCTTCGGCAGCCGGTGCGCTTCGCTGC 

GGCGGCGCAAGCGCTGCTGGAGGGTGGCCCCGCGCTGTTCATCGAGATGA 

GCCCGCACCCGATCCTGGTGCCGCCCCTGGACGAGATCCAGACGGCGGCC 

GAGCAAGGGGGCGCTGCGGTGGGCTCGCTGCGGCGAGGGCAGGACGAGCG 

CGCGACGCTGCTGGAGGCGCTGGGGACGCTGTGGGCGTCCGGCTATCCGG 

TGAGCTGGGCTCGGCTGTTCCCCGCGGGCGGCAGGCGGGTTCCGCTGCCG 

ACCTATCCCTGGCAGCACGAGCGGTACTGGATCGAGGACAGCGTGCATGG 

GTCGAAGCCCTCGCTGCGGCTTCGGCAGCTTCGCAACGGCGCCACGGACC 

ATCCGCTGCTCGGGGCTCCATTGCTCGTCTCGGCGCGACCCGGAGCTCAC 

TTGTGGGAGCAAGCGCTGAGCGACGAGAGGCTATCCTACCTTTCGGAACA 

TAGGGTCCATGGCGAAGCCGTGTTGCCCAGCGCGGCGTATGTAGAGATGG 

CGCTCGCCGCCGGCGTAGATCTCTATGGCACGGCGACGCTGGTGCTGGAG 
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CAGCTGGCGCTCGAGCGAGCCCTCGCCGTGCCCTCCGAAGGCGGACGCAT 
CGTGCAAGTGGCCCTCAGCGAAGAAGGTCCCGGTCGGGCCTCATTCCAGG 
TATCGAGTCGTGAGGAGGCAGGTAGGAGCTGGGTGCGGCACGCCACGGGG 
CACGTGTGTAGCGGCCAGAGCTCAGCGGTGGGAGCGTTGAAGGAAGCTCC 
5 GTGGGAGATTCAACGGCGATGTCCGAGCGTCCTGTCGTCGGAGGCGCTCT 
15 ATCCGCTGCTCAACGAGCACGCCCTCGACTATGGTCCCTGCTTCCAGGGC 
GTGGAGCACGTGTGGCTCGGCACGGGGGAGGTGCTCGGCCGGGTACGCTT 
GCCAGGAGACATGGCATCCTCAAGTGGCGCCTACCGGATTCATCCCGCCT 
TGTTGGATGCATGTTTTCAGGTGCTGACAGCGCTGCTCACCACGCCGGAA 
2^ 1 0 TCCATCGAGATTCGGAGGCGGCTGACGGATCTCCACGAACCGGATCTCCC 

• GCGGTCCAGGGCTCCGGTGAATCAAGCGGTGAGTGACACCTGGCTGTGGG 
ACGCCGCGCTGGACGGTGGACGGCGCCAGAGCGCGAGCGTGCCCGTCGAC 
2^ CTGGTGCTCGGCAGCTTCCATGCGAAGTGGGAGGTCATGGAGCGCCTCGC 
GCAGGCGTACATCATCGGCACTCTCCGCATATGGAACGTCTTCTGCGCTG 
1 5 CTGGAGAGCGTCACACGATAGACGAGTTGCTCGTCAGGCTTCAAATCTCT 
GTCGTCTACAGGAAGGTCATCAAGCGATGGATGGAACACCTTGTCGCGAT 
30 CGGCATCCTTGTAGGGGACGGAGAGCATTTTGTGAGCTCTCAGCCGCTGC 
CGGAGCCTGATTTGGCGGCGGTGCTCGAGGAGGCCGGGAGGGTGTTCGCC 
GACCTCCCAGTCCTATTTGAGTGG7GCAAGTTTGCCGGGGAACGGCTCGC 
20 GGACGTATTGACCGGTAAGACGCTCGCGCTCGAGATCCTCTTCCCTGGTG 
GCTCGTTCGATATGGCGGAGCGAATCTATCGAGATTCGCCCATCGCCCGT 
TACTCGAACGGCATCGTGCGCGGTGTCGTCGAGTCGGCGGCGCGGGTGGT 
AGCACCGTCGGGAATGTTCAGCATCTTGGAGATCGGAGCAGGGACGGGCG 
^ CGACCACCGCCGCCGTCCTCCCGGTGTTGCTGCCTGACCGGACGGAGTAC 

2 5 CATTTCACCGATGTTTCTCCGCTCTTCCTTGCTCGCGCGGAGCAAAGATT 

TCGAGATTATCCATTCCTGAAGTATGGCATTCTGGATGTCGACCAGGAGC 
CAGCTGGCCAGGGATACGCACATCAGAGGTTTGACGTCATCGTCGCGGCC 
45 AATGTCATCCATGCGACCCGCGATATAAGAGCCACGGCGAAGCGTCTCCT 
GTCGTTGCTCGCGCCCGGAGGCCTTCTGGTGCTGGTCGAGGGCACAGGGC 

3 0 ATCCGATCTGGTTCGATATCACCACGGGATTGATTGAGGGGTGGCAGAAG 
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TACGAAGATGATCTTCGTATCGACCATCCGCTCCTGCCTGCTCGGACCTG 
GTGTGACGTCCTGCGCCGGGTAGGCTTTGCGGACGCCGTGAGTCTGCCAG 
GCGACGGATCTCCGGCGGGGATCCTCGGACAGCACGTGATCCTCTCGCGC 
GCGCCGGGCATAGCAGGAGCCGCTTGTGACAGCTCCGGTGAGTCGGCGAC 
5 CGAATCGCCGGCCGCGCGTGCAGTACGGCAGGAATGGGCCGATGGCTCCG 
15 CTGACGTCGTCCATCGGATGGCGTTGGAGAGGATGTACTTCCACCGCCGG 
CCGGGCCGGCAGGTTTGGGTCCACGGTCGATTGCGTACCGGTGGAGGCGC 
GTTCACGAAGGCGCTCGCTGGAGATCTGCTCCTGTTCGAAGACACCGGGC 
AGGTCGTGGCAGAGGTTCAGGGGCrCCGCCTGCCGCAGCTCGAGGCTTCT 
1 0 GCTTTCGCGCCGCGGGACCCGCGGGAAGAGTGGTTGTACGCTTTGGAATG 
GCAGCGCAAAGACCCTATACCAGAG3CTCCGGCAGCCGCGTCTTCTTCCT 
CCGCGGGGGCTTGGCTCGTGCTGArGGACCAGGGCGGGACAGGCGCTGCG 
CTCGTATCGCTGCTGGAAGGGCGAG3CGAGGCGTGCGTGCGCGTCATCGC 
GGGTACGGCATACGCCTGCCTCGCGCCGGGGCTGTATCAAGTCGATCCGG 
1 5 CGCAGCCAGATGGCTTTCATACCCTGCTCCGCGATGCATTCGGCGAGGAC 
CGGATTTGTCGCGCGGTAGTGCATATGTGGAGCCTTGATGCGACGGCAGC 
30 AGGGGAGAGGGCGACAGCGGAGTCGCTTCAGGCCGATCAACTCCTGGGGA 
GCCTGAGCGCGCTTTCTCTGGTGCAGGCGCTGGTGCGCCGGAGGTGGCGC 
AACATGCCGCGGCTTTGGCTCTTGACCCGCGCCGTGCATGCGGTGGGCGC 
20 GGAGGACGCAGCGGCCTCGGTGGCGCAGGCGCCGGTGTGGGGCCTCGGTC 
GGACGCTCGCGCTCGAGCATCCAGAGCTGCGGTGCACGCTCGTGGACGTG 
AACCCGGCGCCGTCTCCAGAGGACGCAGCCGCACTGGCGGTGGAGCTCGG 
GGCGAGCGACAGAGAGGACCAGGTCGCATTGCGCTCGGATGGCCGCTACG 
40 TGGCGCGCCTCGTGCGGAGCTCCTTTTCCGGCAAGCCTGCTACGGATTGC 
2 5 GGCATCCGGGCGGACGGCAGCTATGTGATCACCGATGGCATGGGGAGAGT 
GGGGCTCTCGGTCGCGCAATGGATGGTGATGCAGGGGGCCCGCCATGTGG 
TGCTCGTGGATCGCGGCGGCGCTTCCGAGGCATCCCGGGATGCCCTCCGG 
TCCATGGCCGAGGCTGGCGCGGAGGTGCAGATCGTGGAGGCCGACGTGGC 
TCGGCGCGACGATGTCGCTCGGCTCCTCTCGAAGATCGAACCGTCGATGC 
CGCCGCTTCGGGGGATCGTGTACC-rGGACGGGACCTTCCAGGGCGACTCC 
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TC3ATGCTGGAGCTGGATGCCCGTCGCTTCAAGGAGTGGATGTATCCCAA 
GGTGCTCGGAGCGTGGAACCTGCACGCGCTGACCAGGGATAGATCGCTGG 
ACTTCTTCGTCCTGTATTCCTCGGGCACCTCGCTTCTGGGCTTGCCAGGA 
CAGGGGAGCCGCGCCGCCGGTGACGCCTTCTTGGACGCCATCGCGCATCA 
5 CCGGTGCAAGGTGGGCCTTACAGCGATGAGCA7CAACTGGGGATTGCTCT 
15 CCGAAGCATCATCGCCGGCGACCCCGAACGACGGCGGAGCACGGCTCGAA 
TACCGGGGGATGGAAGGCCTCACGCTGGAGCAGGGAGCGGCGGCGCTCGG 
GCGCTTGCTCGCACGACCCAGGGCGCAGGTAGGGGTGATGCGGCTGAATC 
TGCGCCAGTGGTTGGAGTTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
a 0 XXXXTATGGTATAACTTATTGATTATAATACAGTATACAAAGGTCCCTTT 
TCAG3GACCCTTTCGTATGTTGTAGCTGATTTTATTTTCTTCTTTTCTTT 
TGGGTGCTTTTAATAGCCATATArGATGTTTCATATAAAGTTAAAAGTTT 
CCCCATCTGTCTATCTATAGCGTCATGTTTTTCAGGATTTCTTAATTTCT 
GCAGCAGTTTAGATATATATTTCGCAGGTAACTTTATTATTAATGAATTA 
15 TGGGATTGATACATTTCCTTTGTCCAATCTAAATCTTTTCCTGATTTCAA 
TACCTCACTCTCATCTCTATTAATTATAAAACCGAGCTTTCCATACGGAC 
30 CTGTTAAATATGATTGTAATTGTCTATACTCAGATGGACCTAATTCTTCA 
AAATTTTTAGCATCAAAAACAACTTGTCTTGTTTTATAGTCCTCCAATAC 
TCGTTTCCAAAAATCAGATTTGCCACCATTGGTGCCTATAATATCTCGTC 
2 0 TCTGAACTGCGTTACCATTTGGATGGGACTTGATGTCTGTTAGGTGGGAT 
GCAAATACTATTCTTAGTGCGTCTAAACACCATTGTTCAAATTCAGTGGC 
ACCTTCATTTCCTATCGGTATCTGATCTAGATGAGTAGTTATTTGACCTA 
TTGTTTTATTTCTAATGGCTGAATTATCTGAAATAATATTAATATCATAT 
40 TCGTCATTTATCTCCTCTGCTTCCTCTGGAGCAAGTGCATTACGATTTAG 

2 5 ATTCAAACCAAGCCAGTAACATGGATGAATTAATAATTTCTCATTACTTT 

CAAAACCTTTATCTGGAGTTCGCCCGTCATGGCAAAATGAATAGGATGAG 
GTGTTTTTATCACGAATACCAACAAATCCTACACTATACAAACTTTGGAG 
AATTCCACTTGCCTTTAACAACTGTATTTCAGACGTTATTTTAGGATCTC 
CATTTTCTTCGATTAATTCGAAAGATGCTTCTATTTTTTTTAAGCACGTA 

3 0 TAAACTGTTAATTCAGGTTCAATGCTACGAAATGCACTAGTTATAACCTG 
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TATTGAAGGAAAGATCTTCTGATACTCTTTCCAGAGATCTTCAAGTCTGG 
CCATGGAAATTGACTTGGCTGCATATTCTAGGTCAGTGTTTATGATAGTT 
TCTCTATTCTCTCTGAAT3CGGAAAAAAAAGCTTCATTCAACAATGATAG 
TAAATCCCTGGGCCGGTAAAGGGTAAATTGCAAACATCGCTTAAAACCAT 
5 TCCTCCCTTTAAGATCATCCGCTGTGCATCTATCCCAAACTCGTTGATCT 
^5 '^-TCAATATCTAGCTTAAATGCTACTTTCATTCTTTTAGCTGACAGCAT 
TAGGAGTTGTGCCCAGTCCCAATGCAACCTTATGACTTGACCCTCTATAT 
TTCTCGAGTAATCAGGATCTTCCTTTGATAGCGACCTAAATATATTATCC 
CTTAAAAAAATTATTGGACGAATGCATTTTGCTTTTTGATTTAATTCAAT 

20 

- 0 AGATGCATATGCTAGACCTGCAATGATTCCAATTCCTATATTATCCGGTT 
CATACGCCTCATCTAGCTTATCCATTAATATGACAACTTTCCTGTCTGAG 
CGTTCAAGAAGTGATACTATATTATTTTCTATTTCTGAGATATTCAAATT 
25 GAATTGAAGATCACCAATTGATTCTTCTGGGTTATTTTCATCTAAATACT 
CCTTTGCGACAAGCCTACACTTTCTTAAAATGTCACCTTGTGCAGAATTC 
1 5 CATTTTTTCAAATGTTCATTCAACAATGTTTCTGATGATATTTGAGATGA 
CAATTTGTAATGAGATGATATATATGATGCTATCTCCATTAGCATAGCGT 
30 ATCGCCATAATAGTCTTGTTGCTGCCCTTGCTAAATTAAATGATCCTGTA 
AATGGTTTCAACATTGATCTGAAACCAATAATTTGAGAATCGTCTGGTGA 
GAAACTCAGGATTAATATTTTTTTGTCTTTCTTCCAATGCTCATTTAGCT 
2 0 GAATAAATAAAGCACTTTTACCTGTCCCTCGTCTACCAACAACAATGGTC 

35 

CTGTCATCAGTTTCAATTAGAGTCCTAAAGTCAGCAGTTTCAATGAAAGC 
ATTACTCAACATCTTTTTATCATTTTCTGCTGTCGTATCACCAAACGGAT 
TAGACTTTGAAGTAATATTCAATTCCATATTCAACCTTTTATGTTAGTTG 
^ CTTTCATTTATTACTTTATATACTGTTGAACGAGCAATATTCATTGTTTT 

2 5 TGATATATGTGAGGCACCTAACCCCTGTTGCCACATATTTAATACTGCAT 

CTCTATCTATTTTTCTTTTTCTACCAAAAACAACTCCTTTTGCCA 

SEQ ID No 82 (>Contig57) 

TCATCTATTGTATAGTTTGTATATTGATATGATATAATTATAACATATAA 

3 0 ACAGTAAACTTTCTCTACGTAGATCGAGGAGAAGACTCAATTTGTTGACA 
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TCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACTACTCGCATACCGTTG 
iO CGCAACAGCGGCGCGAGGAGCAGGACGOVTACGACATCACCGGCAATACG 
CTCAGCGTCGCCGACGGACGGTTGTCTTATACGCTAGGGCTGCAGGGACC 
CTGCCTGACCGTCGACACGGTCTGCTCGTCGTCGCTCGTGGCCATCCACC 
5 TTGCCTGCCGCAGCCTGCGCGCTCGCGAGAGCGATCTCGCGCTGGCGGGA 
GGCGTCAACATGCTCCTTTCGTCCAAGACGATGATAATGCTGGGGCGCAT 
CCAGGCGCTGTCGCCCGATGGCCACTGCCGGACATTCGACGCCTCGGCCA 
ACGGGTTCGTCCGTGGGGAGGGCTGCGGTATGGTCGTGCTCAAACGGCTC 
TCCGACGCCCAGCGACACGGCGATCGGATCTGGGCTCTGATCCGGGGTTC 

20 

iC GGCCATGAATCAGGATGGCCGGTCGACAGGGTTGATGGCACCCAATGTGC 
TCGCTCAGGAGGCGCTCTTGCGCGAGGCGCTGCAGAGCGCTCGCGTCGAC 
GCCGGGGCCATCGGTTATGTCGAGACCCACGGAACGGGGACCTCGCTCGG 
25 CGACCCGATCGAGGTCGAGGCGCTGCGTGCCGTGTTGGGGCCGGCGCGGG 
CCGATGGGAGCCGCTGCGTGCTGGGCGCAGTGAAGACAAACCTCGGCCAC 

1 5 CTGGAGGGCGCTGCAGGCGTGGCGGGTTTGATCAAGGCGGCGCTGGCTCT 
GCACCACGAACTGATCCCGCGAAACCTCCATTTCCACACGCTCAATCCGC 
GGATCCGGATCGAGGGGACCGCGCTCGCGCTGGCGACGGAGCCGGTGCCG 
TGGCCGCGGGCGGGCCGACCGCGCTTCGCGGGGGTGAGCGCGTTCGGCCT 
CAGCGGCACCAACGTCCATGTCGTGCTGGAGGAGGCGCCGGCCACGGTGC 

20 TCGCACCGGCGACGCCGGGGCGCTCAGCGGAGCTTTTGGTGCTGTCGGCG 

35 

AAGAGCGCCGCCGCGCTGGACGCACAGGCGGCGCGGCTCTCAGCGCACAT 
CGCCGCGTACCCGGAGCAGGGTCTCGGAGACGTCGCGTTCAGCCTGGTAT 
CGACGCGTAGCCCGATGGAGCACCGGCTCGCGGTGGCGGCGACCTCGCGC 
40 GAGGCGCTGCGAAGCGCGCTGGAGGTTGCGGCGCAGGGGCAGACCCCGGC 

2 5 AGGCGCGGCGCGCGGCAGGGCCGCTTCCTCGCCCGGCAAGCTCGCCTTCC 

TGTTCGCCGGGCAGGGCGCGCAGGT3CCGGGCATGGGCCGTGGGTTGTGG 
GAGGCGTGGCCGGCGTTCCGCGAGACCTTCGACCGGTGCGTCACGCTCTT 
''^ CGACCGGGAGCTCCATCAGCCGCTCTGCGAGGTGATGTGGGCCGAGCCGG 
GCAGCAGCAGGTCGTCGTTGCTGGACCAGACGGCGTTCACCCAGCCGGCG 

3 0 CTCTTTGCGCTGGAGTACGCGCTGGCCGCGCTCTTCCGGTCGTGGGGCGT 
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GGAGCCGGAGCTCGTCGCTGGCCATAGCCTCGGCGAGCTGGTGGCCGCCT 
GCGTGGCGGGTGTGTTCTCCCTCGAGGACGCCGTGCGCTTGGTGGTCGCG 
CGCGGCCGGTTGATGCAGGCGCTGCCGGCCGGCGGCGCGATGGTATCGAT 
CGCCGCGCCGGAGGCCGACGTGGCTGCCGCGGTGGCGCCGCACGCAGCGT 
5 TGGTGTCGATCGCGGCAGTCAATGGGCCGGAGCAGGTGGTGATCGCGGGC 
15 GCCGAGAAATTCGTGCAGCAGATC3CGGCGGCGTTCGCGGCGCGGGGGGC 
GCGAACCAAACCGCTGCATGTCTCGCACGCGTTCCACTCGCCGCTCATGG 
ATCCGATGCTGGAGGCGTTCCGGCGGGTGACTGAGTCGGTGACGTACCGG 
CGGCCTTCGATCGCGCTGGTGAGCAACCTGAGCGGGAAGCCCTGCACCGA 

20 

10 TGAGGTGAGCGCGCCGGGTTACTG3GTGCGTCACGCGCGAGAGGCGGTGC 
GCTTCGCGGACGGAGTGAAGGCGC-GCACGCGGCCGGTGCGGGCCTCTTC 
GTCGAGGTGGGGCCGAAGCCGACGCTGCTCGGCCTTGTGCCGGCCTGCCT 
25 GCCGGATGCCAGGCCGGTGCTGCTCCCAGCGTCGCGCGCCGGGCGTGACG 
AGGCTGCGAGCGCGCTAGAGGCGCTGGGTGGGTTCTGGGTCGTCGGTGGA 

1 5 TCGGTCACCTGGTCGGGTGTCTTCCCTTCGGGCGGACGGCGGGTACCGCT 
GCCAACCTATCCCTGGCAGCGCGAGCGTTACTGGATCGAAGCGCCGGTCG 
3^ ATCGTGAGGCGGACGGCACCGGCCGTGCTCGGGCGGGGGGCCACCCCCTT 
CTGGGTGAAGTCTTTTCCGTGTCGACCCATGCCGGTCTGCGCCTGTGGGA 
GACGACGCTGGACCGAAAGCGGCTGCCGTGGCTCGGCGAGCACCGGGCGC 

20 AGGGGGAGGTCGTGTTTCCTGGCGCCGGGTACCTGGAGATGGCGCTGTCG 

35 

TCGGGGGCCGAGATCTTGGGCGATGGACCGATCCAGGTCACGGATGTGGT 
GCTCATCGAGACGCTGACCTTCGCGGGCGATACGGCGGTACCGGTCCAGG 
TGGTGACGACCGAGGAGCGACCGGGACGGCTGCGGTTCCAGGTAGCGAGT 
^ CGGGAGCCGGGGGAACGTCGCGCGCCCTTCCGGATCCACGCCCGCGGCGT 

2 5 GCTGCGCCGGATCGGGCGCGTCGAGACCCCGGCGAGGTCGAACCTCGCCG 

. CCCTGCGCGCCCGGCTTCATGCCGCCGTGCCCGCTGCGGCTATCTATGGT 
GCGCTCGCCGAGATGGGGCTTCAATACGGCCCGGCGTTGCGGGGGCTCGC 
CGAGCTGTGGCGGGGTGAGGGCGAGGCGCTGGGCAGGGTGAGACTGCCTG 
AGGCCGCCGGCTCCGCGACAGCCTACCAGCTGCATCCGGTGCTGCTGGAC 

3 0 GCGTGCGTCCAAATGATTGTTGGCGCGTTCGCCGATCGCGATGAGGCGAC 
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GCCGTGGGCGCCGGTGGAGGTGGGCTCGGTGCGGCTGTTCCAGCGGTCTC 

CTGGGGAGCTATGGTGCCATGCGCGCGTCGTGAGCGATGGTCAACAGGCC 

TCCAGCCGGTGGAGCGCCGACTTTGAGTTGATGGACGGTACGGGCGCGGT 

GGTCGCCGAGATCTCCCGGCTGGTGGTGGAGCGGCTTGCGAGCGGTGTAC 

GCCGGCGCGACGCAGACGACTGGTTCCTGGAGCTGGATTGGGAGCCCGCG 

GCGCTCGGTGGGCCCAAGATCACAGCCGGCCGGTGGCTGCTGCTCGGCGA 

GGGTGGTGGGCTCGGGCGCTCGTTGTGCTCGGCGCTGAAGGCCGCCGGCC 

ATGTCGTCGTCCACGCCGCGGGGGACGACACGAGCACTGCAGGAATGCGC 

GCGCTCCTGGCCAACGCGTTCGACGGCCAGGCCCCGACGGCCGTGGTGCA 

CCTCAGCAGCCTCGACGGGGGCGGCCAGCTCGGCCCGGGGCTCGGGGCGC 

AGGGCGCGCTCGACGCGCCCCGGAGCCCAGATGTCGATGCCGATGCCCTC 

GAATCGGCGCTGATGCGTGGTTGCGACAGCGTGCTCTCCCTGGTGCAAGC 

GCTGGTCGGCATGGACCTCCGAAACGCGCCGCGGCTGTGGCTCTTGACCC 

GCGGGGCTCAGGCGGCCGCCGCCGGCGATGTCTCCGTGGTGCAAGCGCCG 

CTGTTGGGGCTGC-GCCGCACCATCGCCTTGGAGCACGCCGAGCTGCGCTG 

TATCAGCGTCGACCTCGATCCAGCCGAGCCTGAAGGGGAAGCCGATGCTT 

TGCTGGCCGAGCTACTTGCAGATGATGCCGAGGAGGAGGTCGCGCTGCGC 

GGTGGCGACCGGCTCGTTGCGCGGCTCGTCCACCGGCTGCCCGACGCTCA 

GCGCCGGGAGAAGGTCGAGCCCGCCGGTGACAGGCCGTTCCGGCTAGAGA 

TCGATGAACCCGGCGCGCTGGACCAACTGGTGCTCCGAGCCACGGGGCGG 

CGCGCTCCTGGTCCGGGCGAGGTCGAGATCTCCGTCGAAGCGGCGGGGCT 

CGACTCCATCGACATCCAGCTGGCGTTGGGCGTTGCTCCCAATGATCTGC 

CTGGAGAAGAAATCGAGCCGTTGGTGCTCGGAAGCGAGTGCGCCGGGCGC 

ATCGTCGCTGTGGGCGAGGGCGTGAACGGCCTTGTGGTGGGCCAGCCGGT 

GATCGCCCTTGCGGCGGGAGTATTTGCTACCCATGTCACCACGTCGGCCA 

CGCTGGTGTTGCCTCGGCCTCTGGGGCTCTCGGCGACCGAGGCGGCCGCG 

ATGCCCCTCGCGTATTTGACGGCCTGGTACGCCCTCGACAAGGTCGCCCA 

CCT3CAGGCGGGGGAGCGGGTGCTGATCCATGCGGAGGCCGGTGGTGTCG 

GTCTTTGCGCGGTGCGATGGGCGCAGCGCGTGGGCGCCGAGGTGTATGCG 

ACCGCCGACACGCCCGAGAACCGTGCCTACCTGGAGTCGCTGGGCGTGCG 
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GTACGTGAGCGATTCCCGCTCGGGCCGGTTCGTCACAGACGTGCATGCAT 
GGACGGACGGCGAGGGTGTGGACGTCGTGCTCGACTCGCTTTCGGGCGAG 
CGCATCGACAAGAGCCTCATGGTCCTGCGCGCCTGTGGTCGCCTTGTGAA 
GCTGGGCAGGCGCGACGACTGCGCCGACACGCAGCCTGGGCTGCCGCCGC 
5 TCCTACGGAATTTTTCCTTCTCGCAGGTGGACTTGCGGGGAATGATGCTC 
^5 GATCAACCGGCGAGGATCCGTGCGCTCCTCGACGAGCTGTTCGGGTTGGT 
CGCAGCCGGTGCCATCAGCCCACTGGGGTCGGGGTTGCGCGTTGGCGGAT 
CCCTCACGCCACCGCCGGTCGAGACCTTCCCGATCTCTCGCGCAGCCGAG 
GCATTCCGGAGGATGGCGCAAGGACAGCATCTCGGGAAGCTCGTGCTCAC 

20 

1 0 GCTGGACGACCCGGAGGTGCGGATCCGCGCTCCGGCCGAATCCAGCGTCG 
CCGTCCGCGCGGACGGCACCTACCTTGTGACCGGCGGTCTGGGTGGCCTC 
GGTCTGCGCGTGGCCGGATGGCTGGCCGAGCGGGGCGCGGGGCAACTGGT 
25 GCTGGTGGGCCGCTCCGGTGCGGCGAGCGCAGAGCAGCGAGCCGCCGTGG 
CGGCGCTGGAGGCCCACGGCGCGCGCGTCACGGTGGCGAAAGCGGACGTC 
1 5 GCCGATCGGTCACAGATCGAGCGGGTCCTCCGCGAGGTTACCGCGTCGGG 
GATGCCGCTGCGGGGTGTCGTGCATGCGGCAGGTCTCGTGGATGACGGGC 
3^ TGCTGATGCAGCAGACTCCGGCGCGGTTCCGCACGGTGATGGGACCTAAG 
GTCCAGGGGGCCTTGCACTTGCACACGCTGACACGCGAAGCGCCTCTTTC 
CTTCTTCGTGCTGTACGCTTCTGCAGCTGGGCTTTTCGGCTCGCCAGGCC: 
2 0 AGGGCAACTATGCCGCAGCCAACGCGTTCCTCGACGCCCTTTCGCATCAC 
CGAAGGGCGCAGGGCCTGCCGGCGCTGAGCATCGACTGGGGCATGTTCAC 
GGAGGTGGGGATGGCCGTTGCGCAAGAAAACCGTGGCGCGCGGCAGATCT 
CTCGCGGGATGCGGGGCATCACCCCCGATGAGGGTCTGTCAGCTCTGGCG 
40 CGCTTGCTCGAGGGTGATCGCGTGCAGACGGGGGTGATACCGATCACTCC 

2 5 GCGGCAGTGGGTGGAGTTCTACCCGGCAACAGCGGCCTCACGGAGGTTGT 

CGCGGCTGGTGACCACGCAGCGCGCGGTCGCTGATCGGACCGCCGGGGAT 
CGGGACCTGCTCGAACAGCTTGCGTCGGCTGAGCCGAGCGCGCGGGCGGG 
GCTGCTGCAGGACGTCGTGCGCGTGCAGGTCTCGCATGTGCTGCGTCTCC 
CTGAAGACAAGATCGAGGTGGATGCCCCGCTCTCGAGCATGGGCATGGAC 

3 0 TCGCTGATGAGCCTGGAGCTGCGCAACCGCATCGAGGCTGCGCTGGGCGT 

50 
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CGCCGCGCCTGCAGCCTTGGGGTGGACGTACCCAACGGTAGCAGCGATAA 
C^GCGCTGGCTGCTCGACGACGCCCTCGTCGTCCGGCTTGGCGGCC-GGTCG 
GACACGGACGAATCGACGGCGAGCGCCGGTTCGTTCGTCCACGTCCTCCG 
CTTTCGTCCTGTCGTCAAGCCGCGGGCTCGTCTCTTCTGTTTTCACGGTT 
5 CTGGCGGCTCGCCCGAGGGCTTCCGTTCCTGGTCGGAGAAGTCTGAGTGG 
^5 AGCGATCTGGAAATCGTGGCCATGTGGCACGATCGCAGCCTCGCCTCCGA 
GGACGCGCCTGGTAAGAAGTACGTCCAAGAGGCGGCCTCGCTGATTCAGC 
ACTATGCAGACGCACCGTTTGCGTTAGTAGGGTTCAGCCTGGGTGTCCGG 
TTCGTCATGGGGACAGCCGTGGAGCTCGCCAGTCGTTCCGGCGCACCGGC 

20 

IC TCCGCTGGCCGTCTTCACGTTGGGCGGCAGCTTGATCTCTTCTTCAGAGA 
TCACCCCGGAGATGGAGACCGATATAATAGCCAAGCTCTTCTTCCGAAAT 
GCCGCGGGTTTCGTGCGATCCACCCAACAAGTCCAGGCCGATGCTCGCGC 
25 AGACAAGGTCATCACAGACACCATGGTGGCTCCGGCCCCCGGGGACTCGA 
AGGAGCCGCCCGTGAAGATCGCGGTCCCTATCGTCGCCATCGCCGGCTCG 

1 5 GACGATGTGATCGTGCCTCCGAGCGACGTTCAGGATCTACAATCTCGCAC 
CACGGAGCGCTTCTATATGCATCTCCTTCCCGGAGATCACGAATTTCTCG 
TCGATCGAGGGCGCGAGATCATGCACATCGTCGACTCGCATCTCAATCCG 
CTGCTCGCCGCGAGGACGACGTCGTCAGGCCCCGCGTTCGAGGCAAAATG 
ATGGCAGCCTCCCTCGGGCGCGCGAGATGGTTGGGAGCAGCGTGGGCGCT 

20 GGCGGCCGGCGGCAGGCCGCGGAGGCGCATGAGCCTTCCTGGACGTTTGC 

35 

AGTATAGGAGATTTTATGACACAGGAGCAAGCGAATCAGAGTGAGACGAA 
GCCTGCTTTCGACTTCAAGCCGTTCGCGCCTGGGTACGCGGAGGACCCGT 
TCCCCGCGATCGAGCGCCTGAGAGAGGCAACCCCCATCTTCTACTGGGAT 
40 GAAGGCCGCTCCTGGGTCCTCACCCGATACCACGACGTGTCGGCGGTGTT 

2 5 CCGCGACGAACGCTTCGCGGTCAGTCGAGAAGAGTGGGAATCGAGCGCGG 

AGTACTCGTCGGCCATTCCCGAGCTCAGCGATATGAAGAAGTACGGATTG 
TTCGGGCTGCCGCCGGAGGATCACGCTCGGGTCCGCAAGCTCGTCAACCC 
GTCGTTTACGTCACGCGCCATCGACCTGCTGCGCGCCGAAATACAGCGCA 
CCGTCGACCAGCTGCTCGATGCTCGCTCCGGACAAGAGGAGTTCGACGTT 

3 0 GTGCGGGATTACGCGGAGGGAATCCCGATGCGCGCGATCAGCGCTCTGTT 
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GAAGGTTCCGGCCGAGTGTGACGAGAAGTTCCGTCGCTTCGGCTCGGCGA 
CTTGCGCGCGCGCTCGGCGTGGGTTTGGTGCCCCAGGTCGATGAGGAGACC 
AAGACCCTGGTCGCGTCCGTCACCGAGGGGCTCGCGCTGCTCCATGACGT 
CCTCGATGAGCGGCGCAGGAACCCGCTCGAAAATGACGTCTTGACGATGC 
5 TGCTTCAGGCCGAGGCCGACGGCAGCAGGCTGAGCACGAAGGAGCTGGTC 
15 GCGCTCGTGGGTGCGATTATCGCTGCTGGCACCGATACCACGATCTACCT 
TATCGCGTTCGCTGTGCTCAACCTGCTGCGGTCGCCCGAGGCGCTCGAGC 
TGGTGAAGGCCGAGCCCGGGCTCATGAGGAACGCGCTCGATGAGGTGCTC 
CGCTTCGACAATATCCTCAGAATAGGAACTGTGCGTTTCGCCAGGCAGGA 

20 

1 0 CCTGGAGTACTGCGGGGCATCGATCAAGAAAGGGGAGATGGTCTTTCTCC 
TGATCCCGAGCGCCCTGAGAGATGGGACTGTATTCTCCAGGCCAGACGTG 
TTTGATGTGCGACGGGACACGGGCGCGAGCCTCGCGTACGGTAGAGGCCC 
25 CCATGTCTGCCCCGGGGTGTCCCTTGCTCGCCTCGAGGCGGAGATCGCCG 
TGGGCACCATCTTCCGTAGGTTCCCCGAGATGAAGCTGAAAGAAACTCCC 
i 5 GTGTTTGGATACCACCCCGCGTTCCGGAACATCGAATCACTCAACGTC AT 
CTTGAAGCCCTCCAAAGCTGGATAGCTCGCGGGGGTATCGCTTCCCGAAC 
^0 CTCATTCCCTCATGATACAGCTCGCGCGCGGGTGCTGTCTGCCGCGGGTG 
CGATTCGATCCAGCGGACAAGCCCATTGTCAGCGCGCGAAGATCGAATCC 
ACGGCCCGGAGAAGAGCCCGTCCGGGTGACGTCGGAAGAAGTGCCGGGCG 
20 CCGCCCTGGGAGCGCAAAGCTCGCTCGTTCGCGCTCAGCACGCCGCTCGT 

35 

CATGTCCGGCCCTGCACCCGCGCCGAGGAGCCGCCCGCCCTGATGCACGG 
CCTCACCGAGCGGCAGGTTCTGCTCTCGCTCGTCGCCCTCGCGCTCGTCC 
TCCTGACCGCGCGCGCCTTCGGCGAGCTCGCGCGGCGGCTGCGCCAGCCC 
40 GAGGTGCTCGGCGAGCTCTTCGGCGGCGTGGTGCTGGGCCCGTCCGTCGT 

25 CGGCGCGCTCGCTCCTGGGTTCCATCGAGTCCTCTTCCAGGATCCGGCGG 
TCGGGGTCGTGCTCTCCGGCATCTCCTGGATAGGCGCGCTCGTCCTGCTG 
CTCATGGCGGGTATCGAGGTCGATGTGAGCATCCTGCGCAAGGAGGCGCG 
CCCCGGGGCGCTCTCGGCGCTCGGCGCGATCGCGCCCCCGCTGCGCACGC 
CGGGGCCGCTGGTGCAGCGCATGCAGGGCGCGTTCACGTGGGATCTCGAC 

3 0 GTCTCGCCGCGACGCTCTGCGCAAG CCTGAGCCTCGGCGCCTGCTCGTAC 
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ACCTCGCCGGTGCTCGCTCCGCCCGCGGACATCCGGCCGCCCGCCGCGGC 
^0 CCAGCTCGAGCCGGACTCGCCGGATGACGAGGCCGACGAGGCCGACGAGG 
CGCTCCGCCCGTTCCGCGACGCGATCGCCGCGTACTCGGAGGCCGTTCGG 
TGGGCGGAGGCGGCGCAGCGGCCGCGGCTGGAGAGCCTCGTGCGGCTCGC 
5 GATCGTGCGGCTGGGCAAGGCGCTCGACAAGGTCCCTTTCGCGCACACGA 
^5 CGGCCGGCGTCTCCCAGATCGCCGGCAGACTCCAGAACGATGCGGTCTGG 
TTCGATGTCGCCGCCCGGTACGCGAGCTTCCGCGCGGCGACGGAGCACGC 
GCTCCGCGACGCGGCGTCGGCCATGGAGGCGCTCGCGGCCGGCCCGTACC 
GCGGATCGAGCCGCGTGTCCGCTGCCGTAGGGGAGTTTCGGGGGGAGGCG 

20 

10 GCGCGCCTTCACCCCGCGGACCGTGTACCCGCGTCCGACCAGCAGATCCT 
GACCGCGCTGCGCGCAGCCGAGCGGGCGCTCATCGCGCTCTACACTGCGT 
TCGCCCGTGAGGAGTGAGCCTCTCTCGGGCGCAGCCGAGCGGCGGCGTGC 
25 CGGTGGTTCCCTCTTCGCAACCATGACCGGAGCCGCGCTCGGTCCGCGCA 
GCGGCTAGCGCGCGTCGCGGCAGAGATCGCTGGAGCGACAGGCGACGACC 
1 5 CGCCCGAGGGTGTCGAACGGATTGCCGCAGCCCTCATTGCGGATCCCCTC 
CAGACACTCGTTCAGCTGCTTGGCGTCGATGCCGCCTGGGCACTCGCCGA 
2^ AGGTCAGCTCGTCGCGCCACTCGGATCGGATCTTGTTCGAGCACGCGTCC 
TTGCTCGAATACTCCCGGTCTTGTCCGATGTTGTTGCACCGCGCCTCGCG 
GTCGCACCGCGCCGCCACGATGCTATCGACGGCGCTGCCGACTGGCACCG 
20 GCGCCTCGCCCTGCGCGCCACCCGGGGTTTGCGCCTCCCCGCCTGACCGC 

35 

TTTTCGCCGCCGCACGCCGCGAGCAGGCTCATTCCCGACACCGAGATCAG 
GCCCACGACCAGCTTCCCAGCAATCTTTTGCATGGCTTCCCCTCCCTCAC 
GACACGTCACATCAGAGACTCTCCGCTCGGCTCGTCGGTTCGACAGCCGG 
^ CGACGGCCACGAGCAGAACCGTCCCCGACCAGAACAGCCGCATGCGGGTT 

25 TCTCGCAACATGCCCCGACATCCTTGCGACTAGCGTGCCTCCGCTCGTGC 
CGAGATCGGCTGTCCTGTGCGACGGCAATATCCTGCGATCGGCCGGGCAG 
GAGGTACCGACACGGGCGCCGGGCGGGAGGTGCCGCCACGGGCTCGAAAT 
GTGCTGCGGCAGGCGCCTCCATGCCCGCAGCCGC-GAACGCGGCGCCCGGC 
CAGCCTCGGGGTGACGCCGCAAACGGGAGATGCTCCCGGAGAGGCGCCGG 

5 0 GCACAGCCGAGCGCCGTCACCACCGTGCGCACTCGTGAGCTCCAGCTCCT 
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CGGCATAGAAGAGACCGTCACTCCCGGTCCGTGTAGGCGATCGTGCTGAT 
CAGCGCGTTCTCCGCCTGACGCGAGTCGAGCCGGGTATGCTGCACGACAA 
TGGGAACGTCCGATTCGATCACGCTGGCATAGTCCGTATCGCGCGGGATC 
GGCTCGGGTTCGGTCAGATCGTTGAACCGGACGTGCCGGGTGCGCCTCGC 
TGGGACGGTCACCCGGTACGGCCCGGCGGGGTCGCGGTCGCTGAAGTACA 
CGGTGATGGCGACCTGCGCGTCCCGCTCCGACGCATTCAACAGGCAGGCC 
GTCTCATGGCTCGTCATCTGCGGCTCGGGTCCGTTGCTCCGGCCTGGGAT 
GTAGCCCTCTGCGATTGCCCAGCGCGTCCGCCCGATCGGCTTCTCCATAT 
GTCCTCCCTGCTGGCTCCTCTTTGGCTGCCTCCCTCTGCTGTCCAGGAGC 
GACGGCCTCTTCTCCCGACGCGCTCGGGGATCCATGGCTGAGGATCCTCG 
CCGAGCGCTCCTTGCCGACCGGCGCGCCGAGCGCCGACGGGCTTTGAAAG 
CACGCGACCGGACACGTGATGCCGGCGCGACGAGGCCGCCCCGCGTCTGA 
TCCCGATCGTGACATCGCGACGTCCGCCGGCGCCTCTGCAGGCCGGCCTG 
AGCGTTGCGCGGTCATGGTCGTCCTCGCGTCACCGCCACCCGCCGATTCA 
CATCCCACCGCGGCACGACGCTTGCTCAAACCGCGGCGAGACGGCCGGGC 
GGCTGTGGTACCGGCCAGCCCGGACGCGAGGCCCGAGAGGGACAGTGGGT 
CCGCCGTGAAGCAGTGAGGCGATCGAGGTGGCAGATGAAACACGTTGACA 
CGGGCCGACGAGTCGGCCGCCGGATAGGGCTCACGCTCGGTCTCCTCGCG 
AGCATGGCGCTCGCCGGCTGTGGCGGCCCGAGCGAGAAAATCGTGCAGGG 
CACGCGGCTCGCGCCCGGCGCCGATGCGCACGTCGCCGCCGACGTCGACC 
CCGACGCCGCGACCACGCGGCTGGCGGTGGACGTCGTTCACCTCTCGCCG 
CCCGAGCGCATCGAGGCCGGCAGCGAGCGGTTCGTCGTCTGGCAGCGTCC 
GAGCTCCGAGTCCCCGTGGCAACGGGTCGGAGTGCTCGACTACAACGCTG 
CCAGCCGAAGAGGCAAGCTGGCCGAGACGACCGTGCCGCATGCCAACTTC 
GAGCTGCTCATCACCGTCGAGAAGCAGAGCAGCCCTCAGTCTCCATCTTC 
TGCCGCCGTCATCGGGCCGACGTCCGTCGGGTAACATCGCGCTATCAGCA 
GCGCTGAGCCCGCCAGCAGGCCCCAGAGCCCTGCCTCGATCGCCTTCTCC 
ATCATATCATCCCTGCGTACTCCTCCAGCGACGGCCGCGTCGAAGCAACC 
GCCGTGCCGGCGCGGCTCTACGTGCGCGACAGGAGAGCGTCCTGGCGCGG 
CCTGCGCATCGCTGGAAGGATCGGCGGAGCATGGAGAAAGAATCGAGGAT 



wo 00/22139 



178 



CGCGATCTACGGCGCCATCGCAGCCAACGTGGCC3ATCGCGGCGGTCAAGT 

TCATCGCCGCCGCCGTGACCGGCAGCTCGGCGATGCTCTCCGAGGGCGTG 

CACTCCCTCGTCGATACTGCAGACGGGCTCCTCCTCCTGCTCGGCAAGCA 

CCGGAGCGCACGCCCGCCCGACGCCGAGCATCCGTTCGGCCACGGCAAGG 

AGCTCTATTTCTGGACGCTGATCGTCGCCATCATGATCTTCGCCGCGGGC 

GGCGGCGTCTCGATCTACGAAGGGATCTTGCACCTCTTGCACCCGCGCCA 

GATCGAGGATCCGACGTGGAACTACGTCGTCCTCGGCGCAGCGGCCGTCT 

TCGAGGGGACGTCGCTCATCATCTCGATCCACGAGTTCAAGAAGAAGGAC 

GGACAGGGCTACCTCGCGGCGATGCGGTCCAGCAAGGACCCGACGACGTT 

CACGATCGTCCTGGAGGACTCCGCGGCGCTCGCCGGGCTCACCATCGCCT 

TCCTCGGCGTCTGGCTCGGGCACCGCCTGGGAAACCCCTACCTCGACGGC 

GCGGCGTCGATCGGCATCGGCCTCGTGCTCGCCGCGGTCGCGGTCTTCCT 

CGCCAGCCAGAGCCGTGGGCTCCTCGTGGGGGAGAGCGCGGACAGGGAGC 

TCCTCGCCGCGATCCGCGCGCTCGCCAGCGCAGATCCTGGCGTGTCGGCG 

GTGGGGCGGCCCCTGACGATGCACTTCGGTCCGCACGAAGTCCTGGTCGT 

GCTGCGCATCGAGTTCGACGCCGCGCTCACGGCGTCCGGGGTCGCGGAGG 

CGAGGGAGCGCATCGAGACCCGGATACGGAGCGAGCGACCCGACGTGAAG 

CACATCTACGTCGAGGCCAGGTCGCTCCACCAGCGCGCGAGGGCGTGACG 

CGCCGTGGAGAGACCGCGCGCGGCCTCCGCCATCCTCCGCGGCGCCCGGG 

CTCAGGTGGCCCTCGCAGCAGGGCGCGCCTGGCGGGCAAACCGTGCAGAC 

GTCGTCCTTCGACGCGAGGTACGCTGGTTGCAAGTCGTCACGCCGTATCG 

CGAGGTCCGGCAGCGCCGGAGCCCGGGCGGGCCGGGCGCACGAAGGCGCG 

GCGAGCGCAGGCTTCGAGGGGGGCGACGTCATGAGGAAGGCCAGGGCGCA 

TGGGGCGATGCTCGGCGGGCGAGATGACGGCTGGCGTCGCGGCCTCCCCG 

GCGCCGGCGCGCTTCGCGCCGCGCrCCAGCGCGGTCGCTCGCGCGATCTC 

GCCCGGCGCCGGCTCATCGCCTCCGTGTCCCTCGCCGGCGGCGCCAGCAT 

GGCGGTCGTCTCGCTGTTCCAGCrCGGGATCATCGAGCGCCTGCCCGATC 

CTCCGCTTCCAGGGTTCGATTCGGCCAAGGTGACGAGCTCCGATATCGCG 

TTCGGGCTCACGATGCCGGACGCGCCGCTCGCGCTCACCAGCTTCGCGTC 

CAACCTCGCGCTGGCTGGCTGGGGAGGCGCCGAGCGCGCCAGGAACACCC 
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CCTGC-ATCCCCGTCGCCGTGGCGGCCAAGGCGGCCGTCGAGGCGGCCGTG 

TCCGGATGGCTCCTCGTCCAGATGCGACGGCGGGAGAGGGCCTGGTGCGC 

CTACTGCCTGGTCGCCATGGCGGCCAACATGGCCGTGTTCGCGCTCTCGC 

TCCCGGAAGGGTGGGCGGCGCTGGGGAAGGCGCGAGCGCGCTCGTGACAG 

GACGGGCGCGGGCAGCCCCGGCCATCGGAGGCCGGCGTGCACCCGCTCCG 

TCACGCCCCAGCCCGCGCCGCGTGATCTCCCGCGGACAC-GGCGCGTACCG 

TGGACCCCGCACGCGCCGCGTCGACGGACATCCCCGGCGACCCGCGCGGC 

GCGACCCGCGCAACTCCGGCCCGCCGCCGGGCATCGACATCTCCCGTGAG 

CAAGGGCACTCCGCTCCTGCCCGCGTCCGCGAACGATGGCTGCGCTGTTT 

CCACCCTGGAGCAACTCCGTTTACCGCGTGGCGCTCGTCGGGCTCGTCGC 

CTCGGCGGGCGGCGCCATCCTCGCGCTCATGATCTACGTCCGCACGCCGT 

GGAAGCGATACCAGTTCGAGCCCGTCGATCAGCCGGTGCAGTTCGATCAC 

CGCCATCACGTGCAGGACGACGCCATCGATTGCGTCTACTGCCACACCAC 

GGTGACCCGCTCGCCCACGGCGGGGATGCCGCCGACGGCCACGTGCATGG 

GGTGCCACAGCCAGATCTGGAATCAGAGCGTCATGCTCGAGCCCGTGCGG 

CGGAGCTGGTTCTCCGGCCACGCCGATCCCGTGGAACCGGGTGAAACTCC 

GTGCCCGACTTCGTCTATTTCAACCACGCGATCCACGTGAACAAGGGCGT 

GGGCTGGCGTGAAGCTGCCACGGGCGCGTGGACGAGATGGCGGCCGTCTA 

CAAGGTGGCGCCGATGACGATGGGCTGGTGCCTGGAGTGCCATCGCCTGC 

CGGAGCCGCACCTCCGCCCGCTCTCCGCGATCACCGACATGCGCTGGGAC 

CCC-GGGGAGCGGAGGGATGAGCTCGGGGCGCAGCTCGCGAAGGAATACGG 

GGTCCGGCGGCTCACGCACTGCACAGCGTGCCATCGATGAACGATGAACA 

GGGGATCTCCTTGAAAGACGCAGATGAGATGAAGGAATGGTGGCTAGAAG 

CGCTCGGGCCGGCGGGAGAGCGCGCGTCCTACAGGCTGCTGGCGCCGCTC 

ATCGAGAGCCCGGAGCTCCGCGCGCTCGCCGCGGGCGAACCGCCCCGGGG 

CGTGGACGAGCCGGCGGGCGTCAGCCGCCGCGCGCTGCTCAAGCTGCTCG 

GCGCGAGCATGGCGCTCGCCGGCGTCGCGGGCTGCACCCCGCATGAGCCC 

GAGAAGATCCTGCCGTACAACGAGACCCCGCCCGGCGTCGTGCCGGGTCT 

CTCCCAATCCTACGCGACGAGCA7GGTGCTCGACGGGTATGCCATGGGCC 

TCCTCGCCAAGAGCTACGCGGGGCGGCCCATCAAGATCGAGGGCAACCCC 
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GCGCACCCGGCGAGCCTCGGCGCGACCGGCGTCCACGAGCAGGCCTCGAT 

CCTCTCGCTGTACGACCCGTACCGCGCGCGCGCGCCGACGCGCGGCGGCC 

AGGTCGCGTCGTGGGAGGCGCTCTCCGCGCGCTTCGGCCGCGACCGCGAG 

GACGGCGGCGCTGGCCTCCGCTTCGTCCTCCAGCCCACGAGCTCGCCCCT 

CATCGCCGCGCTGATCGAGCGCGTCCGGCGCAGGTTCCCCGGCGCGCGGT 

TCACCTTCTGGTCGCCGGTCCACGCCGAGCACGCGCTCGAAGGCGCGCGG 

GCGGCGCTCGGCCTCAGGCTCTTGCCTCAGCTCGACGTCGACCAGGCCGA 

GGTGATCCTCTCGCTGGACGCGGACTTCCTCGCGGACATGCCGTTCAGCG 

TGCGCTATGCGCGCGACTTCGCCGCGCGCCGCCGCCCCGCGAGCCCGGCG 

GCGGCCATGAGCCGCCTCTACGTCGCGGAGGCGATGTTCACGCCCACGGG 

GACGCTCGCCGACCACCGGCTCCGCGTGCGGCCCGCCGAGGTCGCGCGCG 

TCGCGGCCGGCGTCGCGGTGGGAACTCGTGCACGAGTCTTTGTCTTGCGC 

CCTGTCCGGGAATAACGGACACCTTATCGCGGGTCGCTCTTTGTGCGCGG 

CTTCTGTACCTCTCAGGACAGGTAGAAGAGGGACTCAGGGGCCCTTATGT 

TAACTGGGGATGCCTTCGGGACGGCCGCAAATATATCCTATCACCTCACT 

GGGTGTGGGGGAGCACCGCGAGGATGTACAACCTCTGTAACTCTATGTGA 

GATAATGTGTGCAGTGATCTGAGACTTATTTGTGTGACCGAGACGTCTCT 

CTTATTGGTACGCATAGTATAATATAACACGTCTCATACATACTCCCGAC 

ATATCCGCGGTATGCGCGCACATAGAATAGGTGATGATAAATCCCTAGTG 

TGTGGAACTAGAAGATGCGGGAGTTACCTGATATTTACGGAAAAAGTATT 

ATCTCAACTACCTCTCTGTTGAGACTATCACTTCGGTGTCGTTGTGCTGC 
TGGT, 

or its complementary strand. 



(b) DNA-sequences which hybridise under stringent conditions 
to regions of DNA-sequences according to (a) encoding proteins 
or to fragments of said DNA-sequences, 
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(c) DNA-sequences which hybridise to the DNA-sequences accord- 
ing to (a) and (b) because of a degeneration of the. genetic 
code, 

(d) allele variations and mutants resulting by substitution, 
insertion or deletion of nucleotides or inversion of nucleotide 
segments of DNA-sequences according to (a) to (c) , wherein the 
variations and mutants offer isof unctional expression products. 

11- Peptide encoded by a DNA sequence according to claim 10 
selected from the group consisting of 

Seq ID No 83 

>Contig56_003 2890 amino acids MW=307428 D pl=5.76 numambig=13 

IRPRAAAVPMRSTVTGGVIAGPELGASYWADNLRQPVRFAAAAQALLEGGPALFIEMSPH 

PILVPPLDEIQTAAEQGGAAVGSLRRGQDERATLLEALGTLWASGYPVSWARLFPAGGRR 

VPLPTYPWQHERCWIEVEPDARRLAAADPTKDWFYRTDWPEVPRAAPKSETAHGSWLLLA 

DRGGVGEAVAAALSTRGLSCTVLHASADASTVAEQVSEAASRRNDWQGVLYLWGLDAWD 

AGASADEVSEATRRATAPVLGLVRFLSAAPHPPRFWWTRGACTVGGEPEASLCQAALWG 

LARVAALEHPAAWGGLVDLDPQKSPTEIEPLVAELLSPDAEDQLAFRSGRRHAARLVAAP 

PEGDVAPISLSAEGSYLVTGGLGGLGLLVARWLVERGARHLVLTSRHGLPERQASGGEQP 

PEARARIAAVEGLEAQGARVTVAAVDVAEADPMTALLAAIEPPLRGVVHAAGVFPVRHLA 

ETDEALLESVLRPKVAGSWLLHRLLRDRPLDLFVLFSSGAAVWGGKGQGAYAAANAFLDG 

LAHHRRAHSLPALSUVWGLWAEGGMVDAKAHARLSDIGVLPMATGPALSALERLVNTSAV 

QRSVTRMDWARFAPVYAARGRRNLLSALVAEDERAASPPVPTANRIWRGLSVAESRSALY 

ELVRGIVARVLGFSDPGALDVGRGFAEQGLDSLMALEIRNRLQRELGERLSATLAFDHPT 

VERLVAHLLTDVLKIiEDRSDTRHIRSVAADDDIAIVGAACRFPGGDEGLETYWRHLAEGM 

WSTEVPADRWRAADWYDPDPE VFGRTYVAKGAFLRDVRSLDAAFFAI S PREAMSLDPQQ 

RLLLEVSWEAIERAGQDPMALRESATGVFVGMIGSEHAERVQGLDDDAALLYGTTGNLLS 

VAAGRLSFFLGLHGPTMTVDTACSSSLVALHLACQSLRLGECDQALAGGSSVLLSPRSFV 
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AASRMRLLSPDGRCXTFSAAADGFARAEGCAVWLKRLRDAQRDRDPII^VVRSTAINHD 
GPSSGLTVPSGPAQQALLRQALAQAGVAPAEVDFVECHGTGTALGDPIEVQALGAVYGRG 
RPAERPLWLGAVKANLGHLEAAAGLAGVLKVLLALEHEQX PAQPELDELNPHI PWAELPV 
AWRRAVPWPRGARPRRAGVSAFGLSGTNAHWLEEAPAVEPVAAAPERAAELFVLSAKS 
AAALDAQAARLRDHLEKHVELGLGDVAFSLATTRSAMEHRLAVAASSREALRGALSAAAQ 
GHT?PGAVRGRASGGSAPlCWFVFPGQGSQWVGMGRKL^4AEEPVFRAALEGCDRAIEAEA 
GWSLLGELSADEAASQLGRIDWQPVXFAMEVALSALVJRSWGVEPEAWGHSMGEVAAAH 
VAGALSLEDAVAI ICRRSRLLRR I SGQGEMALVELSLEEAEAALRGHEGRLSVAVSNS PR 
STVl^GEPAALSEVIAALTAKGVFWROVKVbVASHSPQVDPLREHLIAALGAIRFRAAAV 
PMRSTVTGGVIAGPELGASYWADNLRQPVRFAAAAQALLEGGPALFIEMSPHPILVPPLD 
EIQTAAEQGGAAVGSLRRGQDERATLLEALGTLWASGYPVSWARLFPAGGRRVPLPTYPW 
QHERYWIEDSVHGSKPSLRLRQLRNGATDHPLLGAPLLVSARPGAHLWEQALSDERLSYL 
S EHRVHGEAVLP SAAYVEMALAAGTOL YGTATL VLEQLALERALAVP S EGGR I VQVALS E 
EGPGRASFQVSSREEAGRSWVRHATGHVCSGQSSAVGALKEAPWEIQRRCPSVLSSEALY 
PLLNEHALDYGPCFQGVEQVWLGTGEVLGRVRLPGDMASSSGAYRIHPALLDACFQVLTA 
LLTTPESIEIRRRLTDLHEPDLPRSRAPVNQAVSDTWLWDAALDGGRRQSASVPVDLVLG 

sfhakwevmeriaqayiigtlriv;nvfcaagerhtidellvrlqisvvyrkvikrwmehl 
vatgilvgdgehfvssqplpepdlaavleeagrvfadlpvlfewckfagerladvltgkt 

LALE I LFPGGSFDMAER I YRDS P I ARYSNGI VRG WESAARWAPSGMFS I LE I GAGTGA 
TTAAVLPVLLPDRTEYHFTDVSPIrLARAEQRFRDYPFLKYGILDVDQEPAGQGYAHQRF 
DVIVAANVIHATRDIRATAKRLLSLLAPGGLLVLVEGTGHPIWFDITTGLIEGWQKYEDD 
LRIDHPLLPARTWCDVLRRVGFACAVSLPGDGSPAGILGQHVILSRAPGIAGAACDSSGE 
SATESPAARAVRQEV?ADGSADVVHRMALERMYFHRRPGRQVWVHGRLRTGGGAFTKALAG 
DLLLFEDTGQWAEVQGLRLPQLEASAFAPRDPREEWLYALEWQRKDPTPEAPAAASSSS 
AGAWLVLMDQGGT6AALVSLLEGR3EACVRVIAGTAYACLAPGLYQVDPAQPDGFHTLLR 
DAFGEDRICRA\AmMWSLDATAAGERATAESLQADQLLGSLSALSLVQALVRRRWRNMPR 
LWLLTRAVHAVGAEDAAASVAQAFVWGLGRTLALEHPELRCTLVDVNPAPSPEDAAALAV 
ELGASDREDQVALRSDGRYVARL\^SSFSGKPATDCGIRADGSYVITDGMGRVGLSVAOW 
MVMQGARHWLVDRGGASEASRDALRSMAEAGAEVQI VEADVARRDDVARLLS KI SPSMP 
PLRGIVYVDGTFQGDSSMLELDARRFKEWMYPICVLGAWNLHALTRDRSLDFFVLYSSGTS 
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LLGLPGQGSRAAGDAFLDAIAHHRCKVGLTAMSINWGLLSEASSPATPNDGGARLEYRGM 

SGLTLEQGAAALGRLLARPRAQVGVMRLNLRQWLEXXXXXXXXXXXXXWYNLLI I IQYTK 
VPFQGPFRML* 

Seq ID No 84 

>Contig56_027 700 amino acids MW=80569 D pl=7.02 numarabig=0 

MNMELNITSKSNPFGDTTAENDKKMLSNAPIETADFRTLIETDDRTIWGRRGTGKSALF 

IQLNEHWKKDKKILILSFSPDDSQIIGFRSMLKPFTGSFNIJUIAATRLLWRYAMLMEIAS 

YISSHYKLSSQISSETLLNEHLKKWNSAQGDILRKCRLVAKEYLDENNPEESIGDLQFNL 

NISEIENNIVSLLERSDRKWILMDKLDEAYEPDNIGIGIIAGLAYASIELNQKAKCIRP 

IIFLRDNIFRSLSKEDPDYSRNIEGQVIRLHWDWAQLLMLSAKRMKVAFKLDIEKDQRVW 

DRCTADDLKGRNGFKRCLQFTLYRPRDLLSLLNEAFFSAFRENRETI INTDLEYAAKSI S 

MARLEDLWKEYQKIFPSIQVITSAFRSIEPELTVYTCLKKIEASFELIEENGDPKITSEI 

QU^KASGILQSLYSVGFVGIRDKNTSSYSFCHDGRTPDKGFESNEKLLIHPCYWLGLNLN 

RNALAPEEAEEI NDE YD IN 1 1 SDNSAI RNKTIGQ ITTHLDQI P I GNEGATEFEQWCLDAL 

RIVFASHLTDIKSHPNGNAVQRRDIIGTNGGKSDFWKRVLEDYKTRQWFDAKNFEELGP 

SE YRQLQS YLTG P YGKLGF 1 1 NRDESE VLKSGKDLDWTKEMYQ SHNS L 1 1 KLP AKY I S KL 

LQKLRNPEKHDAIDRQMGiCLLTLYETSYMAIKSTQKKRRK* 

Seq ID No 85 

>Cont:ig57_001 372 amino acids MW=38411 D pl = 12.39 numambig=10 

MLTSXXXXXXXXXXLLAYRCATAARGAGRIRHHRQYAQRRRRTWLYARAAGTLPDRRHG 

LLWARGHPPCLPQPARSRERSRAGGRRQHAPFVQDDDNAGAHPGAVARWPLPDIRRLGQ 

RVRPWGGLRYGRAOTALRRPATRRSDLGSDPGFGHESGWPVDRVDGTQCARSGGALARGA 

AERSRRRRGHRLCRDPRNGDLARRPDRGRGAACRVGAGAGRWEPLRAGRSEDKPRPPGGR 

CRRGGFDQGGAGSAPRTDPAKPFFPHAQSADPDRGDRARAGDGAGAVAAGGPTALRGGER 

VRFQRHQRPCRAGGGAGHGARTGDAGALSGAFGAVGEERRRAGRTGGAALSAHRRVPGAG 

SRRRRVQPGIDA* 
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Seq ID No 86 

>Contig57_002 2259 amino acids MW=238258 D pI-5.92 numambig=0 

MSYTLGLQGPCLTVDTVCSSSLVAIHLACRSLRARESDLALAGGVNMLLSSKTMIMLGRI 

Q/U.SPDGHCRTFDASANGFVRGEGCGMWLKRLSDAQRHGDRIWALIRG3AMNQDGRSTG 

i.MAP^JVlAQEALLREALQSARVDAGAIGYVETHGTGTSLGDPIEVEALRAVLGPARADGS 

RCVLGAVKTNLGHLEGAAGVAGLIKAAIJU-HHELIPRNLHFHTI^PRIRIEGTALALATE 

PVPWPRAGRPRFAGVSAFGLSGTbAmWLEEAPATVLAPATPGRSAELLVLSAKSAAALD 

AQAARLSAHIAAYPEQGLGDVAFSI.VSTRSPMEHRLAVAATSREALRSALEVAAQGQTPA* 

GAARGRAASSPGKLAFLFAGQGAQVPGMGRGLWEAWPAFRETFDRCVTLrDRELHQPLCE 

VMWAEPGSSRSSLl^QTAFTQPALFALEYALAALFRSWGVEPELVAGHSLGELVAACVAG 

VFSLEDAVRLWARGRLMQALPAGGAMVSIAAPEADVAAAVAPHAALVSIAAVNGPEQW 

lAGAEKFVQQIAAAFAARGARTKPLHVSHAFHSPLMDPMLEAFRRVTESVTYRRPSIALV 

SNLSGKPCTDEVSAPGYWVRHAREAVRFADGVKALHAAGAGLFVEVGPKPTLLGLVPACL 

PDARPVLLPASRAGRDEAASALEALGGFWWGGSVTWSGVFPSGGRRVPLPTYPWQRERY 

WTEAPVDREADGTGRARAGGHPLLGEVrSVSTHAGLRLWETTLDRKRLPWLGEHRAOGEV 

VFPGAGYLEMALSSGAEILGDGPIQVTDWLIETLTFAGDTAVPVQWTTEERPGRLRFQ 

VASRE PGERRAPFRIHARGVLRRI GRVETPARSNLAALRARLHAAVPAAA I YGALAEMGL 

QYGPALRGLAELWRGEGEALGRVRLPEAAGSATAYQLHPVLLDACVQMIVGAFADRDEAT 

P WAP VEVGS VRLFQRS PGELWCHARWSDGQQAS S RWS ADFELMDGTGAWAE I S RL WE 

RLA3GVRRRDADDWFLELDWEPAALGGFKITAGRWLLLGEGGGLGRSLCSALKAAGHVW 

HAAGDDTSTAGMRALLANAFDGQAPTAVVHLSSLDGGGQLGPGLGAQGALDAPRSPDVDA 

DALESALMRGCDSVLSLVQALVGMDLRNAPRLWLLTRGAQAAAAGDVSWQAPLLGLGRT 

lALEHAELRCISVDLDPAEPEGEADALLAELLADDAEEEVALRGGDRLVARLVHRLPDAQ 

RREKVEPAGDRPFRLEIDEPGALDQLVLRATGRRAPGPGEVEISVEAAGLDSIDIQLALG 

VAPNDLPGEEIEPLVLGSECAGRIVAVGEGVNGLWGQPVIALAAGVFATHVTTSATLVL 

PRrLGLSATEAAAMPIAYLTAWYALDKVAHLQAGERVLIHAEAGGVGLCAVRWAQRVGAE 

VYATACTPENRAYLESLGVRYVSDSRSGRFVTDVHAWTDGEGVDVVIiDSLSGERIDKSLM 

VLRACGRLVKLGRRDDCADTQPGLPPLLRNFSFSQVDLRGMMLDQPARIRALLDELFGLV 

AAGAISPLGSGLRVGGSLTPPPVETFPISRAAEAFRRMAQGQHLGKLVLTLDDPEVRIRA 

PAESSVAVRADGTYLVTGGLGGLGLRVAGWLAERGAGQLVLVGRSGAJVSAEQRAAVAALE 
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AHGARVTVAKADVADRSQIERVLREVTASGMFLRGVVHAAGLVDDGLLMQQTPARFRTVM 
GPKVQGALHLHTLTREAPLSFFVLYASAAGLFGSPGQGNYAAANAFLDALSHHRRA.QGLP 
ALSIDWGMFTEVGMAVAQENRGARQISRGMRGITPDEGLSALARLLEGDRVOTGVIPITP 
RQWVEFYPATAASRRLSRLVTTQRAVADRTAGDRDLLEQLASAEPSARAGLLQDWRVQV 
SHVLRLPEDKIEVDAPLSSMGMDSLMSLELRNRIEAALGVAAPAALGWTYPTVAAITRWL 
LDDALWRLGGGSDTDESTASAGSFVHVLRFRPWKPRARLFCFHGSGGSPEGFRSWSEK 
SEWSDLSIVAMWHDRSLASEDAPGKKYVQEAASLIQHYADAPFALVGFSLGVRFVMGTAV 
SLASRSGAPAPLAVFTLGGSLISSSEITPEMETDITAKLFFRNAAGFVRSTQQVQADARA 
DKVITDTMVAPAPGDSKEPPVKIAVPIVAIAGSDDVIVPPSDVQDLQSRTTERFYMHLLP 
GDHEFLVDRGREIMHIVDSHLNPLLAARTTSSGPAFEAK* 

Seq ID No 87 

>Contig57_027 419 amino acids MW=46737 D pl=5.09 numambig=0 
MTQEQANQSETKPAFDFKPFAPGYAEDPFPAIERLREATPIFYWDEGRSWVLTRYHDVSA 
VFRDERFAVSREEWESSAEYSSAIPELSDMKKYGLFGLPPEDHARVRRLVNPSFTSRAID 
LLRAE I QRTVDQLLDARSGQEEFDWRDYAEG I PMRAI S ALLKVPAECDEKFRRFGSATA 

RALGVGLVPQVDEETKTLVASVTEGLALLHDVLDERRRNPLENDVLTMLLQAEADGSRLS 
TKELVALVGAIIAAGTDTTIYLIAFAVLNLLRSPEALELVKAEPGliMRNALDEVLRFDNI 
LRIGTVRFARQDLEYCGASIKKGEMVFLLIPSALRDGTVFSRPDVFDVRRDTGASLAYGR 
GPHVCPGVSLARLEAEIAVGTIFRRFPEMKLKETPVFGYHPAFRNIESLNVILKPSKAG* 

Seq ID No 88 

>Contig57_043 492 amino acids MVJ=52617 D pl=11.54 numambig=0 
MAARARKSCRARGSRPAPMRTSPPTSTPTPRPRGWRWTSFTSRRPSASRPAASGSSSGSV 
RAPSPRGNGSECSTTTLPAEEASWPRRPCRMPTSSCSSPSRSRAALSLHLLPPSSGRRPS 
GNIALSAALSPPAGPRALPRSPSPSYHPCVLLQRRPRRSNRRAGAALRARQESVLARPAH 
RWKDRRSMEKESRIAIYGAIAANVAIAAVKFIAAAVTGSSAMLSEGVHSLVDTADGLLLL 
LGKHRSARPPDAEHPFGHGKELYFWTLIVAIMIFAAGGGVSIYEGILHLLHPRQIEDPTW 
NYWUSAAAVFEGTSLIISIHEFKKKDGQGYIJ^AMRSSKDPTTFTIVLEDSAALAGLTIA 
FLGVVn:.GIffiLGNPYLDGAASIGIGLVIJ^VAVFIASQSRGLLVGSSADRELIJU^IRAI^ 
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ADPGVSAVGRPLTMHFGPHEVLWLRIHFDAALTASGVAEARERIETRIRSERPDVKHIY ' 
VEARSLHQRARA* 

12. DNA sequence according to any of claims 1 to 5 wherein the 
DNA is selected from the group consisting of 

(a) the following DNA sequences: 

Seq ID No 89 (>ContiglO) 

GGTAGTGAAATATGCTGTATTCAACAGAAAGCTTGATGAATTGATCTAGA 
AAGTAGAGCGAGAGAATCAAGTAAGATAGTAGGATGCATTATAAATATAG 
AATATATACTGCATACGATGACAGCATGCGCACGAATAGAATGCATAAGA 
GGCAAGCCAATAACCAAAAGTGGAGCCAGAGGAGATAGTCTCGCCAGTAG 
AAATAATGCTCAGCC7UVGCGAGGTTGGACATATCAGTTCCAGAGTAGGTC 
TCAACCCCGTATATGAGTCCAATGAAGCCTGTCTCATCCAGTTAACGGCC 
TTTTGAGCAGAGAATCCTCCCTATTTTCGGAGAGGACGCGTCGAATATAA 
AGCAGGTCCAAAGAAGCAAGCAATAGCCAAAAGTTTGAAAGGTTAGTACG 
AGCAGCGGCTGGAGGACACTATGGTCGTGCAACGGGGGTAAAGGGTTTCA 
CGTATTGTAGCAGAGCACGTCAGAGGGTTATTCGTGACATTCGAGGCCAA 
CGAGGCGGTAGGACTTCGTAAGCGCATGACCATCCCGGTCACAAACGTAG 
TGCGGAGCGCCTCGTCACGCTCAACAAGGCCCTAGAACGCGCGGCGCAGA 
TCGACCCTTTTAAACGCCGGCACCGAGCCGGACCGTCCTGCCCAGGTTGT 
AAAGCGCTCCATCGGCCGACTTATGGCACTCGAGCCAAATCGCCCGGTTC 
CCCATCGGTCAGCGCAAACGGCCCCCCCGGGCGTCGCCACCCGCGGCGAC 
GAGGGGCCGTCCAGACGGGTGATCTCTCTCGTGAGCTCGCGGAGAGAGCC 
TCCTCGCAAGATCGATGTCAGCGGGATCGCGCGCCCCGTCCGCACCTGAA 
ACGCGTGCTGGAGCTCGACGGCAGCGAGGGAGTCGAGGCCGAACCGCGAT 
ATCGGCAGCGCGTCGTCGATCTGCCCGGCGTCCAGACGAAGCGCGCGGGC 
GAGGGTCGAGCGCAGCGCGTCCAGCAGGCTCCGGCCGGAGGGCTCCTCGG 
TCTCCGGGGGCGCGTCGTCCGGGGGCGAGGCGTCGTCGAGGAGCTCCGGC 
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GCGAACGCGACGTGGCGCTCGCCGAGCGCGTCCTCGAGAAAGGCGCGCCG 

GCACTCCCTCCGGCGGACCTTCCCGCTCGACGTCTTCGGCAGCGCGCCCG 

GCGCGATCAGCGCGACGGCGTGCGCGACGAGCTG3TGCTCGGCGGTCACC 

GCCTCGCGCACGGCCGCCACGATCTCGCGCGGATCCGCGGCCACGCGCGG 

GTCGACCTCGCACACCACGGCGAGGCGCTCCTCGCCCTCGTGCTCCACGG 

AGAACGCGGCGCTGCAGCCCGGCCGGACGGCGCGATGGCTGCTCTCGACG 

GTCTTCTCGATGTCCTGCGGGAAGTGGTTGCGGCCTCGAAGGATGATGAG 

GTCCTTCGACCTCCCCACCACGAACAGCTCGCCGCCCCGGAGGAAGCCGA 

GATCTCCCGTGCGCAGGTAGCGCGGCGCCGCGCTGCCAGCGAGCGTGGCC 

CCGAACGTGGCCTCCGTCTCCTCCGGGCGCCCCCAGTAGCCGACGGCTAC 

GCTGGGCCCGGACACCCAGATCTCCCCGATCTCCCCCGGCCCGAGCTCGT 

TCCCCGCGGGATCGACGATCGCGACCGCCCGCGGATCGAGCGCCCGACCG 

CTGCCGACGAACACGCGCGCGCCCTCCGCCGCCGACGCGACGGCGCGCCC 

GAGCTCCACCTCCTCGGGGGCGAGGCGCGCCAGCACCGGCGCCTCGGCCC 

GCGCTCCGCCGCTCACGATGAGCGTGGCCTCGGCGAGCCCGTAGCAGGGA 

TAGAACGCCTCTCGCCGGAACCCGCTGACCGCGAAGGCGCGCGCGAAGCG 

ATCGAGCGTGTCGGCGCGCACCGGCTCGGCGCCCGTGAACGCGACCTCCC 

ACGACCGCAGATCGAGCGCCGCTCGCTCCTCCTCCGAGCTCTTCCGGACG 

CACAGGTCGTATGCGAAGTTCGGGCCGCCGCTCACCGAGGCGCCGAGCGC 

CGAGACGGCGCGGAGCCACCGCATCGGCCTCTGCAGGAACGAGAGCGGCG 

ACATGAGCGCGACGCGGATCCGCCGGTAGAGCGCCTGCAAGATCCCGCCG 

ATGAGCCCCATGTCGTGATACGGCGGCAGCCAGATCACCCCGACCGGATC 

CGGGCTCGTCAGGTCGAATCCATGCGCGATGAGCCGCGAGTTGTGCAGCA 

GATTCCCGTGGGTGAGCATCACCCCCTTGGGCTCGCCGGTCGAGCCGGAG 

GTGTATTGAAGGAACGCGACCGACTCCGGCCGGAGCGCCGCGCCCGGCCC 

CTCGATCGGGCCCGGCGACGGGCCGTCGGTCGCGATCCACCGGAGCCGCT 

GCAGCGCGGCGGCCGCGGCGCTGGCCGGCAGGGACGCCACGATGCCGGCG 

ACGGCCGATGACGTGAGCGCCGCCTCGGCGCGCGCGTCCGCGACGATGGA 

AGCGACGCGCGGCAGCGTCCGCTCGAGCCGGCCGAGATCCGGCGGATAGG 

CGGGCACGGTCCGGACTCCAGCGTAAAGACACCCGAAGAACGCGGTGATG 
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TACTCGATCCCCGGCGGATACAGCAGCAGCGCGCGGGCCCCGGGGGCGAC 

GCCCGATGCCTGCAAGAGGGCCGCGACGGTTCGCGCGCGCTCGTCAATTT 

CCCGCAGGGTCACCCAGGTCGCCCCGGCCTCGACGTCGCCGGACTCAAGA 

AAGCAATAGATTGGGCGGGCGGGCTCAGCTTCGGCCCGCTGGCGCAAGAG 

G7CGATAACGGTC-GAAGGGCGGTTCCGTTCGTTCCGTTCCAATGCAAGAA 

AAGCATCATTCATTGAACAGACCCCTCCGCCGCGGAGATAGCAGCTTGTC 

CGCTGCGACACAACCGCCGCGCGACGCGCGTGGCACGGCGGGATCCGGGC 

GTTACTCCACCTGCACTTCCCGTCGCGTCACGCTCGCTCCGCCGCGGGTG 

TCGTGAACCACCGCCCACAGCGACACGCGCCCTGGCTCCGAGGGCGGCGT 

CCACGTGGTCCCGTTGCCCCCGCGGGAGGCGCCGGTCGTATCGCTCACCA 

GGCGGCGCGCCCCGTCGAACTCGCCACCGTCCGTGTAATAGTCGACCCAG 

ATCGCCTCGCGCGCCGGTGGACCGCCGAGCCCGGCGGCTTCCTCGTCCAC 

CTCGGCGGCCTTCTCGGGCACGACAGCCTCAATCTCATAGGTCGTGCACT 

CGTCCTCGGCCGGCTCGGTCCGGCCGCAGCCTTGGGCCTGCTCCTCGGAC 

CGAACGCACCGCTTCACGACGGGCAAACCGTCCTCGCCCGGCGCGACCTC 

ATTGCCATCGAGCTrCAGCGTGAAGCCGTCGATGGGCGGGTTCGTGTTCA 

GCCGCTCCTTCTTGAAGACATAGACCTGCGTGTAGCCCACGACGAAGCTG 

TCCGGACCGAGCACCGTCCCGTCGTCGCCGACGCACTCCAGCGGAAACCC 

GGCCGTTTCGGGCGCCGAAGCCACGCGTGTCGTGCCGGCGCACACGGCGA 

ACAGCACGTAAGCCGACGAGTACACCGTCCCCGTCTCGGTGGGCCTCGCG 

TCCTTGAGGATCTCCTTGGGCAGCTTCCACCCGAACGAGACCGCATCGGG 

CTCGCCGCTCTTCTCCGGACCGATCTCCTGCTGCGCGAAGGGGACGGTGC 

GCTCCCCGTCGCCGCTGCCGCTGCCGCCGTCGCCGCTGCTGCCGCCGTCG 

CCGCTGGCACCGCCACCGCCGCCGCCGCCAGCGCCACCATTGCCGCCTTC 

GCCGCCTCCAGCGCCACCACTGCCGCCGTCGCCGCCGCTGCCACCATCGC 

CACCACTGCCGCCGCCGCCGCTTCCGCCGCTGCCGGCCGGCACCGCCTCC 

CGGATTCGCGACGATTCCCACCGCATAGGTGCCCAGCCACTGCGGGATGC 

ACCCGAGGTGCTCGTCCACCCCGACCGGCGGATTCACGCAGCCGCCCACC 

CACGTGACCTCGACCTTCCGCGGCGCGCCGCCCTCCGCGCCTTTCGCGTC 

GGCGTACGTCATCCGGAACGTCACGAGCTCTTCCGCCGCCGCGTACGGCT 
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TGTCCGCCGTCACGGCGAGGACGCGGAGCCCCTTCACCTCGGACGAAGGG 
GCCATGTCGCTCCCGGCGCAGCAAGGGATGCCCACGGCCAGGGTCGAGAG 
CGCGGCCAGCAGCGCGCGACGAGCGGGCAGTGCGGTCCGTTTCATCAGAA 
ATCTCCTCGCAGCCCGAGCGTGGGCAGGAAGGGGAGCCCCGTCACGTACT 
5 CCCGCTTCGTGTAGTTGAAGTTGTAGCTGATGCCCTCCGCAGCCATGTAA 
^5 TTGTAGACGTTCTGGATATCGAGGTAGAGCCCGAGCTGCCACCTCTTGAA 
TTTCCACGTCTTGTCGGCGCGGATGTCGAGCTGGTGAAACAGCGGCATCC 
GGTCGCTGTAGTCACCCCCGAGCGGGATCGGCGAATACCTCGCCGAGGAC 
GCGTGGTAGATCGCGTTCACCCGGTTCGGATTGCACCCCTTCTCCTCCGG 

20 

10 ATCGCAGACATAGGGCGTCTGCAGGTTGCCCGACACGAGCCGGAAGCGCG 
CGCCCAGCTCCCAGCCCCGGCCGAGCCGCAGGCTCCCGAGCACCGTCAGC 
ACGTGCGTCTGATCGAACTGGGTGAGGTGCTCCTCCTCGTCGGGGCCGTC 
25 CTTGCGCACCGACCGCGAGAGGGTGTACGCCGCCCAGCCGAAGAAGCGCT 
CGTCCGGCTTGTACTTCAACAAGAGCTCGCCGCCGACCGCGTATCCGGTG 
15 CCATCGTTGGCATAGTCGTCCTTCTCCGGCGAGAAGACGACCAGCCGATC 
GAGCTGCTTGTAGAACCCGTCCAGCGTCACCTCGATCTGCGGCGTGATCT 
30 CCTGCTCCACGCCGAGGCCGTAATGCACGGCGCGGTTCGACTTGAGCTCC 
GCATTGCCGAACGGCTCGATGCTCTCCGCGAACTGCGGCGCCTGATAATA 
AAGGCCCACGCCCCCCTTGGCCGTCGTCCGCGGGAAGCCGCTCCGGATGT 
20 CGTAGCGCGCGTTGACCCGCGGGCTCACGTCGAGCGTCTGCGTATCGAGC 

35 

GCGTAGTCGACCCGCACCCCGGGGACGATCCGCGCCCGCGGCGAGGGGAC 
GACCTCGAGCTCGGCATACGCCGCGGGCCGCGAGTACGCGCCGTCGAACG 
ACCGATCCTGGAACGGGTACGTCGAGAACGGCTGGTTCGACGGGTGGCCC 
40 GCGGGCTGCTGCGACGGCGCGCGGATGTTGACCGTGGCGACGCCGCCCGA 

2 5 GAGGTCGGTGCCGACGTTCATCGTGAGGTACCGCGCGAACCTGTGCGAGA 

GCTCCAGCCGCAGGTCGAGCGAGGTCGAGACGACGTTGAAGGCGAGGGGA 
GAGATCTCGAAGTCGGCGATGTCCCGGCCGAGCGCCATCGACCACAGCAG 
CGGATCCCGGCTCCCGATCCGGTTCTCGTAGCTGAGCTGGAAGCGCTGGA 
AGGCGGTGTGCAGCCCGAAATCGCCCGTCAGCGCCGGCTCGTCCTCCGGC 

3 0 GGCTTGTCCAGGGTGATCTTGAAGGCGTCGTCCGATCCGTAGAAGCTCGC 



55 
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GCCCACGCGCTCGCTCGCGGAGGGGCGGCCCTCGAGGACGAACTGGTAAT 
CATAGTAGACGGGCGCCTGCGTGACGCTGGAGCCCGCCTCCTTGAGCACG 
GGCCCGAGCCACGCGTCGACCCAGCTGCGGCGGCCCGCCGCGATGAACGT 
CCAGTCCTTGAGGAACGGGACGGGGCCCTCGAGGAGCACGCGCCCGTCGA 
5 TGAGGTCGAGCTGGACCACGCCGTGGTACTTGCCGTCCTGCTTCGGCGAG 
15 CGGAGCCCGACGTCGACGATGCCGCCCATGGCGCGGCCGTACACGGCGCT 
GAAGTTGCCCGGATAGAAGTCGATCTTCTCGAGCATCTCGGTCGGCACGA 
CCGAGGAGAGGCCGCCGAAGTGGTAGATGATCGGCACCGGGGTGCGATCG 
ACGAACGTGAGCGTGTCCTGGGGCGCGGACCCGCGCACGATGAGCAGCCC 

20 

IC GAAGCCGCTGCGCGCGACGCCCGGCAGGCTCTGCAGCGACCGCAGCGCGT 
CGCCGCCGGTGCCGGGGATGCGGTCGATCTCGCGGCGCTCGATCGTCCTC 
CGCGTCACCTCGCGCGGCGGGCGCTCGCCCTGCACGGTCACCTCGATGCC 
25 CGGCGCCTTGCCGTCCTGCGGCGCGGCGAGCGAGATGCGGTAGCGCACCT 
CGATCGCCTCGCCGGCCGCGATCTCCTCCTCGGCGGCGAACGGCTCGAAC 

1 5 CCCGCGGCGGCGACGTCGACGCGGTACTTGCCGGGGGGGAGATTCTTGAA 
GCGGAACTTGCCGCCCTGGTCCGTCTTCGCCTCCTCGCGGCCGCCGTCGG 
30 GGCGCACGAGGGTGACCGCGATGTCCGGGAGCGGCTCGCCGGTGCCCGCG 
GACAGGACGGTCCCGACCACCGTCTCGACGTCGGCGGGCGGCGCCGGCGC 
GGCCGCATCGGCGGGCTTGGGCGTGAGCGTGAACGCGTACCGGTAGAGGA 

2C TGCGCGCCGCCGCGGGCGTGCCGTCCGGGCGCCGCGCCGGCGCGAACTCC 

35 

AGGCCGGGCGCGGCCCGCGAGCGCCGCCTCGTTGAAGCCGTGCCCGCCGG 
GCGTCGCGACCTCGGCCTTGGTGACGCGCCCGGTCTTGTCGATGTCGAGC 
TTGAGGATGACGCTGCCCTCGACGCCGGCGCGCTGGGCCTCGATCGGATA 
^ CGCGGGCGGGGAGTACTTGATCAGCGTCGGCGGGCTGATGGCGGCGGGCG 

25 CCGGCGGGGGCGCGCCCGGCTGAGGGACGACGACCGCGCCGGCGCCGCCG 
CGGGGGACCGAGGCGCCGCCCGAGTCGCCCTCGGCGGCAGGAGGAGGCTC 
GGGCGGGGGCGCGCCGGCGGGCTGCGCGCGCGCTGCGCTGCCGGTCATCG 
CGACCGCGAGCAGCAGCGCTTCCGAGACGACGAGGCGCATCACGGAGGAC 
GCTGTGGAAGGCATGCGGCCCGCCCTCTCGCATGGCGAGGCCGAGGCGGA 

3 0 AAGACGCATCGCGCAGCCAGGACCGTGCTTCACATTGCTTCACACAACGG 



55 



VVO 00/22139 



PCT/US99/23535 



191 



10 



GCGCCGCGCGCGCTCCCGGGCCGCGC3AGCGCAGGCGGCGCGCGCGCCCG 
CGGGCGGCGCGATCGCGAGCGGCGCGCGGTGCGATCAGCCGCCGACCTCG 
GCCACGAACCGGCTCACGTCGTCGCTGTCGCCCACGAGCAGCAGCGTGTC 
GCCGTCGCGGATCACGTAGTCCGGTGTGGGCGCCTCGAGCCGCGGCTTGT 
5 CGCCGGGCCGCTTGTTCGTGTGCGGCCGCACACCGAGCACGTTGATGCGG 
15 TACCGCTGGCGGATCTTCGAGCCGGCCAGCGTCTGCCCGACCAGCGGCCC 
GTGGGCGTTCCAGGGGACCACGCGGTAGTGGCTCGCGAGGTCGAGGAGGT 
CCTGCGCGAGCGGCATGGTGATGTCGGCGCCGACGCGGCGGCCCATCTCG 
GTCTCGAGCTGGATGACGCGGGTCGCGCCCACCGCGCGCAGGATGTCGGC 

20 

10 CTGGCGATCGGTGGCGGCGCGCGCGATGATCTCGCGCACGCCCATCCGGA 
• CGAGGGAGGCCACGCAGAGCACGGACGGCTCGAAGTGCTCGCCGAAGGTC 
ACGATCGCGGTCTCCACGTACTGCGCGCCGATCCCCTCGAGCACCTTGTG 
25 GACGGTGGCGTCGCCGACGAACGCGGCCGAGGTCTTGTCCTTCACGGCGT 
CGACGGCCTCCGGGTTGTTGTCGACCGCGATCACCTCGGCCCGGTTCTTC 
1 5 CAGAGGGTCTCGACGACCGACGTGCCGAACCGCCCGAGCCCCGATGACGA 
GGACGCTCTTCGATTTCATGGTCTCCGGTCGCGCGCCGCCCCCTCGGGGC 
GCGGCGCCGCGAAGGTCTCACGGATGCGCCGGGAGCGCCACGGTTCGGCG 
CTGCCGTCGTCGCGACGGCGCGGCCCGCGCGGGCCGCGCCGCGCGGCGCT 
CAGTAGAGCTCGTCCTGGTGCTGCCACCGCTCCGAGATCCACGCCTTCAG 
20 GTACGCGATCTCCTCCTCGTACGTCGTGAAGTCGTCGCGCCAGCTCCAGC 
CCTCGTAGCTCCGGTACGCCTCGCCCCACCGCGCCTCGTCCCGGCGCGCG 
CTCGCGTCGATGCGCTCCACGTAGCCGTCCACGATCGCGTGGATCTCGGC 
CTCGGCGAGCGCGCCGCGCAGGACCTGATCGTAGCGGGCGCGCAGCGGGT 
40 CGCCGATCGACGGCTCCTCGAGGAGGCGCTCGAAGAGGAGGTTCACGTCG 
2 5 CGGTAGTCGACGCGATCCGACGCCGGCTCGCGCTCGGTCTCCCACGACTG 
GCCGAAGCTCGCGTTGAAGTCCCACGGCGCGTAGCGGAATACGCCGTCCG 
CGGCCGGATCGCGGTAGTGGTAGCTGTTCTTTCCGGCCGAGTCGTTGGCC 
ACGATGAACGTGACGAAGATCCACCAGTCCTCGTAGTCGCGCAGATCGAT 
CCGCGACCCGATCTCGGCGGCGAACGTGGCGTCGTCGGACTCGGCCACGA 
30 AGCTCACGAGATCTTCCAGATCCGAGAACGCCTCCGGCTCGCCCTCGGCC 



30 
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GGCGCCCCTTCCTTCTTCTCGAAGCCGTCGTGCAGCGTGTCCTTGGGGTC 
GCCGGACCGGTCGGTCAGCGCGAAGTTCGCGTCGTGGCTGACCGCCTTGT 
AGAGGTTGCCGTCCTGCGGGTAGCCGTGGTCCTCCATCAGGTAGCCGTCG 
ACGTGATCCGCGACGGTGTAGAGCCCCGCGTACTCCCCGTCGAGGTACAG 
GACGGCGCTGTAGGTCTTGATCTGGATGTGCTCGGGATCGAGGCGGTTCC 
AGAGGTCATAGGCGAGGCGCTGCCGGACATAGGAGTTGTCGTCGAACGTC 
GTGATGAGCACGACCTTGCGGCGATCGGTGAAGCCGCCCGCCTCGTCGGG 
CTCGTTGAACTTGTCGTCCTTGGGGAACTTGAGGGTGTAGCTCCGCTTCG 
GGTACGAGAGCGAGCTCTCGCCGCGGAGCTCCGCCTCCGGGGCGTACGTG 
TGGCCGCGGTAGATCACCGTGGCCGGGGCGTACTCCTTGTCCTCGGGGAC 
GGGCGAGAGGAAGAGCACCGGCAGGCCGTACTCCTCGGGGTAGCGGGTCG 
GATCGACGACGGGCACGTTCGACGGATCGGCGAAGGCGTCGGCGACGCCG 
ACCTTGACGCGCCCGACCTCGGACGTCTGCGCGACGCGGATCTCGATGTC 
GTAGACGGCGGCCTGATCGAGCCCGGGCGAGAACGTCACCTCGCGCGCGA 
TCGGGTCGTACGCGGCGCCCTCGGGGAGCGGGCCGACCTCGAACGCGTCG 
CCGGCGAGCGCGAGGCCGCTCGCGCACGTCACCGGGAACGTCACGGTCTC 
CCCCTCGAGGAGCCAGTGCGGGCCGCCGCCCGACGGCTGGCAGCGCGAGC 
CCTCGGCGCTGGAGCCCCCGCCGCTGGAGCCGGAC 

Seq ID No 90 (>Contigll) 

GGCGACCCCACATATCACATAGTAGAATCAGTGTGAGTTAGACAAATGTC 
GAGTGATGAGAAGGACAGAAGTGAGAACTCTGTCGATCACTGTAGAACGA 
GAGAGTATGAGCCTGCATACATGATAGCGGACATGAGAACGAGTGTANTA 
TGATGCTACTAAGAGAGTAACAGATCAGAGACTAGAGTAGAGCAATAGAA 
NTCAGAGATAAGTCAATGACGAGGAGTAGTGATAGAGCTCTTAATAATGG 
CTGAGGTCGAAGATAGAAGTGCATAGAGCGATAGATATACAATCGGTTGA 
AGCAGAGAGTAAGATAAGATCAGACACNGAGTACAGAGAGAGACGAATAG 
ATGGCGTGATNTCACAGAGAGGTGCGAGCGTAGCTGACGAGAGCAGAGAC 
GCAGAGTAAGTCACACCTAGATAGTTACGGCGAGAGACAAATGATAGGAA 
GGAC-TGGACGAGATCAACAGNCCGGAGCACAAGAACGTGAGATGCGACCG 
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TGTAATAAACAGGAGACAAGAGCGACTACATAAGAGAGCGAAGCGAATAG 

ATAAGATATAAGCCCAGAGCAAAATAGAAGGAGAGAGAGAGTATTTGTAA 

TAAAGCAACAAGACGGAGAGAGCGAAGCAGCAGGCAACGATTAGAAGAAA 

GACGACAGGAAAGTGAAAGCGAAAGAGAGCAGGTAGAAAGAGAACCAAAA 

AAGCACGAAGGAAAAGGAAGCTTCTATGATAGGTGCGGGACAAGGCGTAG 

CTACAGGAGACAGCCGGCATACGAGGAGCCGGTAAAAGCTAGCCTTTCAG 

AACACATCGGGAGCGCGTAAAGGCGGACCACGCTCGACGGGATCATGTAC 

GCCGACAGCGACGCCTTCAGCCCCGCGCGCACGTCCGGCGCGTCGCCCGC 

CGCGTCGCCGTCGAGCACGACGTAGGCGACCAGGCGGGCGTCGCCCGGCG 

CGTCCTCGCGCAGGACCACGGCCGCCTGGCCCACGCCGGGCACGCGCCGG 

ATCTGCGCCTCGACGTCGCCGAGCTCGATCCGGTGCCCCCGGA3CTTGAT 

CTGGTGGTCCGAGCGGCCCTGGAACTCGAGCATCCCGTCGGGCAAGAAGC 

GCGCGACGTCGCCGGTCCGGTACATCCGCCCGCCCGCGGCGCGCGCGCAC 

GGGTCGGGCAGGAAGCGCTCCGCGGTGAGCCCGGGCTGCCCCACGTAGCC 

GCGCGCGAGCGGCGCGCCCGCGATGTAAAGATCGCCGAGCGCGCCGATGG 

CGGGGCGGCGCATCGCGCCGTCGAGCACGAACACCTCGGCGTTCGCGACC 

GGCGCGCCGAGGGGGACCCACGTGACCCGCGGGTCGCTCGGCAGGACGCA 

GCCGGTCACCGCGATCGCGGCCTCGCTCGGCCCGTACATGTTGATGAGGT 

CGCCGTCGTGCTTCGCGTAGAAGCGCCGGACGAGATCGAGCGGCACCGCC 

TCGCCGCCCACGAGGACCTTCCGCAGGCTCGCGGGGAACGGCTGCTCGGG 

CCCCCCGAGGAACGCCGCGAGCATCGAGGAGACGAAGTACGCGGTCGTCG 

CCCCCTCGTCGCGCACGAGGCGCCGAAGGTACTCC'GGATCGCGGTGCCCG 

CCGGCCCGGGCGACGACGATCCGCGCGCCGAACGAGAGGGGCCAGAAGAT 

CTCCCAGACGGAGACGTCGAAGCCGAACGCGGCCTTGAGCAGGACCCGGT 

CGTCCGCGGTGAGCGCCCAGTACCGCTGGATCCACTGCATCTGGTTGACG 

ATGGCGCGGTGGGAGATGAGGCTCCCCTTCGGCGTGCCCGTCGATCCGGA 

CGTGTAGATGACGTACGCGCCGCTGTCCGGCGGCGGGCTCACGGCGGGCC 

GCGCGTCGGAGCACGCGGCGATCTCGGCGGCCTCGGCGTCGAGGAGCAGC 

GTCGTCCAGCCGCCGGTCGGGAGCTCGTCGGCGATCGCGTCGTGCGTGAC 

GAGGAGGCGCGCCCGCGCGTCCCGCATCATGAAGGCGAGGCGCTCGCCGG 
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GGTACTCGTGGTCGAGCGGCAGGTAGGCGCCGCCCACCTTGAGCACCGCC 
,0 AGGGTCGCGACGACCATGTCCTCGGAGCGCGGCACGCAGACCCCGACGAT 
CGTGTCGAGCCCGACGCCGCGGCGGCGCAGGCAGCTCGCGAGCCGGTTCG 
CGCGCCGCTCGAGCTCGCCGTACGTGAGCGACTTGCCCTCGCTGCGCACC 
5 GCGACGACGTCGGGGTGCTGCTCGGCGCGCTCCTCGAACCACCGGTGCAG 
15 CGCGCAGGCCGACGGCAGCTCCATCGCGGGGCCGCGCGACCACGCCTCGA 
TCTCGGCGCGCTCGCCGGGGCCGACGTACTCGCCCTGGGCGACCGGACGC 
TCGGGGTGGCGCGACAGGTCCTCGAGCAGCGCCGCGAGGCGCTCGGCGAG 
GCGCTCGGCGTCGCGGCGCGCGCCGGCCGACGCGTCGTAGCGGAGCTCCA 

20 

10 GCGACGCCGACGGGCCCGCGCCGGCGCAGTGCAGCCGGCAGGCGACCTGG 
TCAGACGTGCTCCAGACGTCGAGCACGCGGGCCCGCGCGCCGTCGAGCGA 
CAGCGCCGGCGCGGCCTCCGCGGCGGAGAAGCCCCCAGCTCATCCGGTGG 
25 CTCACCCCGGGCGCGGCGTCCTGGTGCGCGGCCGCCTCGGCCTCGGCGAG 
CGCGAGCCGCCGCGCGACGTCGGCGAGCGTGTCCGAGGCCGAGATCTCGA 
1 5 TCCGCACCGGCAGGAACCGCGCGAACGGCCCCACCGCGCCCGCGAGCGCG 
TCCAGCGACCGCCCGTCGAAGCGGACGGCCACGGTGACCTCGGGCTCGTT 
30 GCCGCCGCTCATCCGCCACAGGAGCGACGCCCACAGGGCCAGGAGCACGA 
TCCGCTGCGGGACCTGCCACGACGACGACCAGCGCTCGACCTGCGCCATC 
CCGCCTTGTCCCAGATCGACCCGCGCGCGCCCCGAGCCGGCGCCGGCGCC 
20 GGCGCCGCCGCGGCTGAAGGCGAGGTGGAGCGGGGGCCCGAAA-GCGAGC 

35 

GGCGCTCGGCCCAGAACCTGCGCCCGTCGCCGGCGTCCTCCGACTCGAGC 
ATCCCGTTGAGCCACTCGGCGACGTCCGCGTACTGCTGCTCGGGCGGCGC 
GCCCGCGCCCGCGGTCGACGCGCAGAGCTCGCGGACGAGCGGGGCGATCG 
^ ACTCCTCGTCGACGCACCACGCGGGCGCCGCGAGCACGAGCCGGCGCTCC 

25 TCCGGGCCGACGCGGACCAGGCCGACGCGCAGCCCGTCGTCCGCGCCGCG 
GTCCTCCGAGAGGCGCGCGACGAGCCGCGACATCCGCTCGCCCTGCTCGG 
CTTCGGAGCACCCGACCCAGTCGTCCTGCTGACGCCCACGCGAAGCGCGG 
CTCGCCGACCACCTGCGCGGCCTCGCCCGCCCCGCCTCGACGAGGCGCGT 
GCGCAAGATCTCGTGCCGCTCGGCCAGCGCGAGCGCCGCCGCCGAGAGCC 

30 GTCCCTCGTCGCACGGGCCGGTCACGGCGACGACGGCCAGCGTCCGGCAC 
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CCGGGCGCCCCCGCCTCCCGGTCGAGCGCGCGGATCGCCCGCTGCTGCGG 
iO C^GAGGCTGAAGCCGGTCATCTCGTGGTCACTCATCCAGGTCGTCCTTC 
GGTGAGGTCTTCGCTTCGCCCGGCGCGCCCGGGCGCGGGAGGGTCACGGC 
GCGCCGGCGCGCGGAGGTCAGCTTGTCGAGCGCGGCGCCGCGCGCGGCCT 
5 TCCGCTCGAGCTCCCGGCGGGCCGCCGCGGCGCGCTCGAGCTCCCCCCGG 
^5 AGCTCCGAGACCGGGGTGTCCGGCCGCGCCGTCGCAGTGGCCAGGATCTG 
CCGGTAATCGCTAAGGAAATTGTCGACCGTCGCCGCCCGGTACAGCTCGC 
TGCTGTGCTCGACGCCGAAGCGGAACGAGCCGCCGGCCTCGGCGACCGTG 
AGGACGAAGTCGAACGCCGTCGTGGTCGCCTCGCCCTCCAGCGCCTCGAG 

20 

1 0 CTCGAGCCCCTCGAGCTTCATCGGGGGGACGTGCACGTTGCGCATGACGA 
ACTTCGCGTCGAAGAGGGGCACGTGCCCGACGGCCCCCTTCGGCCGCAGG 
GCCTCGACGAGCCGGTCGAACGGCAGGTCCTGGTGCTCGAACGCCTCGAG 
25 CGCGACGTCGCGCACGCGGCGGACCAGCGCGCCGAACGTCGGGTCGCCCC 
CGCAGTCGGTCCGGAGCACGAGCTGGTTGACGAAGAAGCCGATCATCGGC 

1 5 TCGGTCTCGACGCGGTTCCGGTTCGCGACGTCGGTGCCCACGACGAGGTC 
CTCGAGCCCGGTGCGCTC-GTGCAGGACGAGCTTGTACGCGGCGAGCAGGG 
CCATGAAGGGGGAGATCGCCTCCCGCTCGCAGAACGCCTTGATCTGGCGG 
GTGAGCTCGGCCCCGGCGTCGAGGCTCCGCCGCGCCCCGCGCCACGTCCT 
TCGCCCCGCCGGCTCGTGGTCGACCGGCACGCGGGCCCGGCGCAGCGCGC 

2 0 CCGAGAGCTTCGTCGTCCAGTACCGGAGCTCGCCCTCCAGGACCTCGCCG 

35 

GACAGCCACGCCCGCTGGGCTGCGGCGAAGTCGACGTACTGCGCCGGGAG 
CTCCGGCAGCCGGGAAGGCTGGCCCTGCGCGAAGCCGCCGTAGAGCGCGG 
CGAGCTCGCCGACGAAGACGCCGACCGACCAGACGTCGAACACGACGTGG 
40 TGCACGACGAGCGCGATGACGTGCTCGTCGTGGCGCTTCCGGATGACCCG 

2 5 CACGCGGAGGAGCGGCCCGCGGCTCAGGTCGAACGGCGCGAGGCTCTCCT 

CGAGGACGAGCGCCGAGACCGCCGCGTCGAGGGCCTCGCCCGCGAGGTGC 
TCGAGGTCGGACATCCGGAACGGCACCCGGGCCTCGGGCGCGACGACCGG 
''^ GAACGGCACGCCGTCCCTGGCGCTGAACGTCGTCCGCAGCGCCTCGTGGC 
GCCGCGCGATCTCGAACAGGCTGCGGCGGAGCGCGTCGACGTCGAGCCGG 

3 0 CCCGTCGCGCGCACCACGAACGGGATGTTGTACGCCGGGCTGCCCGGCTC 
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GAGCTGATCGACGAACCACAGCCGGTGCTGCGCGAACGACAGCGGGAGCG 

GGCCGTCGCGGGGGATCCGCGCGATCGGGGGGAACTCGCGCCGCCGCGCC 

TCGCCGCGGCGCGCGGCGTCGACCTGGGCCGCGAGCGCCGCGACCGTCGG 

CCCCTGGAAGAGCGCCCGCAGCGGGAGCTCGACGCCGAGCTGCGCGCGGA 

TCCGGGACATCACCTGGGTCGCGACGAGCGAGTCGCCGTGCAGGCCGAAG 

AAGTCGTCGTGGACGCCGATCTCGTGGACGCCGAGGAGGGCGCTCCCAGA 

TCGCGGGCGATCGCGCGCTCGGACTCGGTCGACGGCGCGGCGAACGCGGC 

CCCGGCGTGCGCGCGGGAGACCGCCGACGTCGGGAGGGCGTTCGGCGCGG 

GCGCGAGGGGCGCGTCGGCCGGCGCCGGCTCGGCGGCGGGGGCGCCGCGG 

CGCGGCGCGATCCAGTGCCGCGCCCGCTCGAACGGGTACGTCGGCAAGCG 

GACGAGCGCGCCGGGGGACGCCCCCGCGGACGGGCCGTCCAGTCGACGGC 

GTGGCCCGCCTCCCAGAGCTGGCCGAGGGCCTCGGCCAGGCTCGCGGGCT 

CGGACGCGGCGTGGGTCGACCCGAGGCTCGCGATCGCGGCGCCGCCGCGC 

CCGGCCAGCGTCTGCCGCACCAGCGTGGTCAGCCCGCGGCCGGGGCCGAC 

CTCGAGGAACAGGGCGTGCCCGGACGCGAAGAGCGCCTCGACGCCGTCGC 

TGAAGCGGACCGGCTGGCGGAGGTGCCGCGCCCAGTAGGCCGGATCGGTC 

GCCTCGGCGTCGGTGAGGAGGGCGCCGGTGACGTTCGAGACCACGGGGAT 

CTCCGGCGGGGAGAGCCGCGCGCGCCGCACGCTCTCGAGGAACGGGGCCA 

CCGCGCCGTCGATGAGCGCGCAGTGGAACGCGTGGGACGTCTGCAGCGGC 

CGGGCGAACACCTCGCGCGCCTCGAGGCGCGCGGCGAGATCGCGGATCGC 

GCTCGCCGGGCCCGCGACAACCGTGAGCTTCGGGCTGTTGACCGCGGCGA 

TCTCCAGGCCGGCCTCGAGGAGGCCCTCGACGTCCGCGGCCGGCAGGCCG 

ACGGCCAGCATGCTCCCGGCCGGCGCCGCCTGCATGAAGCGCCCCCGATC 

GATGACCAGGGACATCGCGTCCTCGAGCGTGAACACGCCCGCGACGCAGG 

CCGCCACGAGCTCGCCGAGGCTGTGGCCGATCATCGCCGCGGGCTCGATC 

CCCCAGCTCATCCAGAGCCTGGCGAGCGCGAGCTCGACGGCGAAGAGCGC 

GGGCTGCGCCAGCGCGGTGCCGAGCAGCGTGCGCCCGTCGCCCTCGCCCT 

CGCGGAAGACGACCTCGCCGAGATCGAGGCCGCGCGCCCGCGCCGCCGCC 

GCGCACGCGTTGAAGGCGCTCCGGAACGCCGCCTCCTGCGCGTAGAGCGC 

GCGGGCCATCCCGACGGCCTGCGCGCCCTGGCCCGGGAACGCGAAGACGG 
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GCGCGGCTCATCGGGGCGCGCGAGCGCGCTCGCCCCCTCGCGGGCGAGCC 

CCTGGATCGCCTCGGCGCGCGTCCGGGCGACGACCGCCCGGCGGTACGGG 

TGCTCCGCGCGCCCGGTCTGGAGGGTGAACGCGACGTCGTCGAGCGGGAC 

GTCGGTCGCCTCGAGGTGCGCGGCGAGCTGCGCGCAGGCCGTCGACAGCG 

CCTCCGGCGTGCGCGCCGAGAGCGTCAGCACGTGATCGCGCTCCGGGGCC 

GGGGCGCGGGGCGGCAGCGGGGGCGGCTCCTCGAGCACGACGTGCGCGTT 

CGTCCCGCCGATCCCGAACGAGCTCACGCCCGCGCGGCGCGGGCGGAGCT 

CGCGGGGCCAGGGCGCCGCCTCCCGCGGGACGAAGAACGGGCTCGCCGCG 

AGGTCGAGCTTGGGGTTCGGCGCCTCGAAATGGACGCAGGGCGGGATCTC 

GCCGCTCCGCACGACGTGCGCCGCCTTGATGAGGCCCGCGACGCCCGCCG 

CGGCGTCGAGGTGGCCGATGTTCGCCTTGATCGAGCCGAGCGCGCAGTAC 

GCCTTCCTCGGGGTCTTGCGGCGGAAGGCCTGCGTGAGCGCCTCGACCTC 

GATCGGATCGCCGATCGCGGTCGCGGTGCCGTGGGCCTCGACGTAGCCGA 

TCGAGCCGGGATCGACGCCGGCGACCGACTGCGCCTCGGAGATCGCCGCC 

GCCTGGCCGTCGACGCTGGGCGCCATGAAGCCGACCTTGCGCCCGCCGTC 

GTTGTTGACGGCGGAGCCCCGGATCACCGCGTGGACCGTGTTTCGGTCGC 

GGAGGGCGTCCGCGAGGCGCTTCAGCGCGACGATGCCGACGCCGCTGCCG 

CCCACGGTCCCCTCGGCGCGCGCGTCGAACGGCCGGCAGCGGCCGTCGGG 

GGAGCAGATGCTGCCGGGCACGTACGGATACCCGCGCTTCTGCGGGATGC 

CGATGGAGACGCCGCCCGCCAGCGCGAGATCGCACTGGCCGCCGAGGAGG 

CTCTCGCACGCCATGTGGACGGCCACCAGCGACGTCGAGCACGCGGTCTG 

CACGACCACGCTCGGCCCGTGGAGGTCGAGTTTGTACGAGACCCGCGTCG 

CGAGGTAATCCTTCTCGCTCGCCAGCATGAGCGCGTGCGGATCGACGGTG 

GCCGCGAGATCCGGGTGCGAGAGGAGCTGGAGGAGGTACGTGTTGGAGCC 

GCACCCCCCGAAGACGCCGATCGCGCCCGGGAACCGGGCCGGATCGCAGC 

CGGCGTCCTCCAGGGCGGCGACCGCGCACTCCAGGAAGAGGCGCTGCTGC 

GGGTCCATGAGCTGCGCCTCGCGCGGCGAGTACCCGAAATAGGACGCGTC 

GAAGCGGTCGATGTCGTCGAGCAGGCCGCCCGCGCAGACGACGGGCGCCC 

CGGGGGCCGCGCTCGCGCCGACCGGCGGCTCCTCGCGCTCGCTCTCCGGG 

AAGCGCGCGATCGACTCGACGCCGCGCCGCACGTTCTCCCAGAGGGCGTC 
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GACGCTCGGGGCGCCGGGGAAGCGGCCCGCCATGCCGACGATCGCGATGT 
CGCTCCCCCCGTCCTCGGTCTCGATCGGCTCTGACATGGCTATCCTCGCC 
CCCGGCGGCGTCGCGCGTCGCGGCGCGCCTCGGCGCGCTGCGCCCCGACG 
TCGGCCGGCTCGGCCTTGACCGTCGCCGCGTCGAGCCGCTGCGCCAGTTG 
5 CTCGATGGTCGGGTACTGGAACAGGTCGGTCAGCGACACGGCCTGCGCCG 
15 CGGCGCCCTCGTCGGGCGCGCGCGCCGCGATGCGCTCGGCGAGCAGGCGC 
TGCGCGCGCACGAGGAGCAGCGAGGTGAAGCCGAGCTCGAAGAGGTTGTC 
GGTCACGCCGACGGCCTCGACCTGCAAGACCTCCGCGAGCACCGAGGCGA 
TGAGCCGCTCGGTCGCGGTCCGCGGGGCGACGGCCGCGGCGCGCGGCGCG 
10 ACCGCGGCGGGATCCGGCAGGGCG3CGCGGTCCACCTTGCCGTTCGCGCT 
CAGCGGCAGCGCCGGGAGGACGACGACCTCCGCGGGGATCATGTACTCCG 
GCAGCTTCTTCCGGACGAAGTCGCGGAGCGCGGCGCCATCGCCGTCGGCG 
25 CCGACGACGTACGCGACCAGGCGCTTCTCGCCCGACGGATCGGTCTTCGC 
CGCCACGACCGCCTGCTCGACCGAGGGGTGCTGCGCGAGGGCGGCCTCGA 
15 TCTCGCCGAGCTCGATGCGGAAGCCGCGGATCTTCACCTGATGGTCGGTG 
CGCCCGAGCAGCTCGATGGTCCCGTCGGCGAAGTAGCGGCCCAGGTCGCC 
30 TGTCCTGTACAGCCGCTCGCCGGTCGTGGGGTGCTTCAGGAACCGCTCCC 

GGGTCCGCGCCTCGTCGCGCCAGTATCCGAGCGCGACGCCGATCCCGCCG . * 

ATGTGGATCTCGCCGGGGACCCCGATCGGACACGGCTCCAGCCCCTCGTC 
2 0 GAGCACGTAGGTGTGCTGGTTGGCGAGCGGGCGGCCGTAGGGGATGCTGC 
GCCACGCCGGGTCGACGTCCGCGATCGGGTGGGCGATCGACCAGATCGAC 
GCCTCGGTCGCGCCGCCGAGGCTCACGACGCGGGGCGCGCGGCAGGCCGC 
GCGGATGCGATCGGGGAGCTTCAGCGGGATCCAGTCGCCGCTCATCATGA 
40 CGAGGCGGAGCGACGACAGCGCCGGGTCGCCCGCGCCGGGGGACGCGTCC 
2 5 ATGAGCATCTCCATCAGCGCCGGGACCGAGTTCCACACGGTCACCCGCTC 
GCGCTCCACGAGCTCGCGCCAGTGCCCCGGATCCGAGGCGCGGGTACGGT 
CGGGGATCACGACGGCGCCTCCGGCGGCGAGCGTCCCGAACACGTCGTAG 
AGCGACAGGTCGAAGCTCAGCGACGAGAGCGCGAGCACCCGGTCCTCCGG 
GCCGACGTCGAAGCGGCGGTTGATGTCGAGGACCGTGTTCACCGCGCCGC 
30 GGTGGTCGATCATCACGCCCTTG3GCAGCCCCGTGGACCCGGACGTGTAG 
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ATCACGTAGGCCAGGTCGTCCGTGCTTCCGCCGGGCGGCCGGCGCGCGAC 
GGGCTGCTCGCGCCACCGCTCGTCCGCGTCGACGGCGAGGCGCTCGATGC 
CCGCGGGCCAGGCGATCGTCCCGTCGACCGCCGACTGCGTGAGGACGAGG 
CGGACCTCGGCGTGCTCCAGGAGGTGCCTGAGGCGCTCCTCGGGGAGGCG 
AGGGTCCAGGGGCAGGTAGGCGGCGCCGGCGCGCAGCACGCCGAGCACGG 
CGGCCACCTGCTCCCAGCCCTTCTCCATGACCACGGCGACGAGCGCGTTC 
GCGGTCGCTCCGGAGCGCGAGGCCGCCGCGGCGATCGCCTCGGCGCGCCG 
GGCGAGCTCCCCGTAGGTGAGGCGCCGCTCGGCGTCGACGACCGCGCACG 
CGTCGGGCTGCTCGACGGCGCGCTCAAAGAACGGCTCCTCCAGCCGGAGG 
TGATCCGGGGTTGCGACCGCGGTGTCGTTCCACGCGACGAGGGCGCGCTC 
GCGGTCTTCCGGCGCGACGGAGAGCGCGCGGACGCGCTGCGCGGGGTCCT 
GAGTGGCGCGCGAGAGCACGCTCTGCATCGTGGCGAGCATCCGGTCGATG 
25 GTCGCCGCGTCGAAGAGGTCGACGTTGTACTGGAGCGAGATCACGTCGCG 
CCCGCCGCGCGGCTCGACGCTGAAGCGCAGGTCGAAGCGCGTGGCCTCGA 
15 CCGGGAGATCGAGCGGCTCGATCCGCACCTCGCCGAGCTCGAGCGCCTCG 
GTTGGGGCGTTCTGCACGACGAGCATGACCTGGAACAGCGGCGAGCGGCT 
30 CAGGTCGCGGCGGGGGTTGACCGCCTCGACCACCTTCTCGAACGGGGCGT 
CCTGGTGCTCGAACGCCTCGAGCGCGACCTTCCGCGCCCGCGAGAGGAGC 
TCCTCGAAGGTCGGGTCGCCGCCGAGGTCGAGGCGCATGACGATCGTGTT 
20 CACGAAGAAGCCGACGAGGGGCTCGAGCTCGGGGCGAGGCCGGTTGGCGA 

35 

CCGCGGTCCCGATGGCGAGGTCGTCCTGGCCCGAGCTGCGCCGGAGGAGC 
ACGCCGAGGGCGGCGAGCAGGACCATGAAGCGGGTGGCGCCGCGGCTCCG 
GGCGAGCTCGTCGAGCTGCGCCACGAGGCGCGCGTCGAGCGGGAGGACCC 
^ GCTCCGCGCCGCGGAACGTCTGGACGGGCGGCCGCGGTCGATCGGTCTGG 

25 AGCTCCAGGACCGGCAGCCCGCGGAGGGTCGCTGTCCAGTGAGCGAGCTT 
G7CGGCGAGCCGCTTCCCCGCGAGGTGGCGGCGCTGCCACACCGCGAAAT 
CGACGTACTGGAGCGGCAGCTCGGGCATGTCCGCGGGCCCGCCGCCCCGC 
GCGCGCCGGTAGAGCTCCGCGAGATCGCGGACGAGGGGTTGGAAGGACCA 
GGCGTCCGTGACGATGTGGTGCGTGGACAGGACCAGGACGCAGACGTCGT 

3 0 GGTCGAGGCGGAACAGCCTGGCGCGGAACACGGGCCCGCGCGCGAGGTCG 
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AACCCCGTGGCCTGCTCGCGCGACGCCCAGGCGCGCGCCGCGGCCTCCGC 

CTCGTCCGGGGGCGTGCCGCGGAGGTCGACCACCTCCGCGGGGGCCGCCT 

CGGGCTCGCAGATCTTCTGCGCCGGCGTGGGGCTCGCGACGAACACCGTG 

CGCAGGCTCCAGTGCCGCCGGACGAGCGCGGCGAGCGCGGAGGACAGCGC 

GTCGACGTCGACGAGGTTCCGCAGGCGGACCGCCTGCACCACGTTGTAGG 

CGGTCCCGCCGGGGAGCAGCTGCTCGAGGACCCACAGGCGCTCTTGCTCG 

TACGAGAGCGGATACGGCTCGTCCGCCGGCGCGCGGCCCAGCGAGGGCGC 

GATCTCGCTGGCGGGCACCGTCGCTGCGGCGGCGGTGGTCGAGGCGGCGC 

CGGAGGAGAGGCGATCGGCGAGCTGGTGGAGGGTTGGGTGCTCGAAGAGC 

GTGCGGAGGGTGGTGCGGATGCCGAGGGAGGACTCGATGCGTCCGAGGAC 

CTGCATGGCGAGCAGGGAGTGGCCGCCGAGGTCGAAGAAGCTGTCGTGTC 

GTCCGACGCGGTCGAGGTGGAGGACGGATTGCCAGATGTGCGCGAGCTCC 

CGCTCGAGCTCGCCCGAAGGGGGCTCGTAGTCGGCGTGCGCGGCGGGTGG 

CGCAGGGAGGAGCTTCTTGTCGACCTTGCCCGAGAGGGACATGGGCAAGG 

CGGGGAGCAGGACGAAGTGGGCGGGCACCAAGGCGTCGGGCACCAGGCGG 

GCCATGCCCTCGCGCAGGTCGCGCTCGGAGGGCGGGTCGGCGCCCGGCAC 

GACATAGGCAATCAGGCGCGCGGCGCTGCCTTGGCCGTGGAGGACGACGA 

CGCCCTCGCGGACGGCGGGCAAGCGTCGCAGGGCGGATTCGACCTCGCCG 

AGCTCGACACGGCGACCGCGGAGCTTGACCTGCTCGTCGCGGCGTCCGGC 

GAAGGCGAGCTGTCCGTCGGGGCGCCAGCGCACCAGGTCGCCGGTGCGGT 

AGAGGCGTGCGCCGGGCTGGCCGAAGGGATCGGGCAGGAAGCGCTCTGCG 

GTCAGGTCCGTGCGTGTGTAGCCCTGGGCGAGGCACGCTCCGCCGATGTA 

CAGCTCGCCGAGGACGCCGGGCGGGACGGGCTGCATGTGCGGGTCGAGGA 

CGTAGACGAGGGCGCTGTCGATGGGTCGGCCGAGCGGGGGCTCGTCGCCG 

AGGTCGGCGACCTCGGCGACGGTGGTGATGACGGTGGCCTCGGTGGGGCC 

GTACATGTTGAAGAGGCGGAAAGGGAGCGGTCGCCGGAGCGGATGGAGCT 

TGTCGCCGCCGACGGTCATCGCGCGCAGGGCGATGCCGGTCCAGTCTTGC 

TCGAAGCACGCCTCGGCCAGGGGCGTGGCCATGAACGAGAGCGTGGCCCG 

CTGAGCGACAAGCCAGGAGACGAGCGCTGTGGGAGAGCGGAGCGCGTCGT 

CGTCGGCGAGGAGGAGTGCAGCGCCGCAGGCGAGCGGCGTCCAGATCTCG 
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TAGACGGAGGCGTCGAAGCCGCTGGAGGCCAGCTGAGTCCAGCGATCGCG 

GGGTGAGAGCGCGAGCAGGTGCTGGAAGAAGGAGACGAGCCTTGAAAGGC 

TCGCATGGCGCACACAGACGCCCTTCGGCGTGCCGGAAGAGCCGGAGGTG 

AAGAGGACATAGGCCAGGTCGTCGGGCCTGGAGACGAGAGGAATGTGGGT 

GCTGGGCGCGCACGCCCCGTCCTGGACGAGGTGGACGGGGCAGGGGGCGG 

CGGTGAGCTTGTGGCTGGCCTGGCTGCTGGTGAGCACGAGCGCGGCGCGG 

CAGTCGGCGAGCATCTCGGCCAGGCGCGCCGGGGGGTTGGCGGGGTCGAG 

CGAGGCATAGGCGGCGCCTGCCTTGAGGACGGCGAGCTGGGCGGCGACCA 

TGCGGGGCGAGCGCTCGATGCAGACGCCGACGACGCTGCCGGGGCCGACG 

CCGCGGTCGCGCAGCCACAGGGCGAGCTCGGTGGACCAGGTGCTGAGCTC 

TGCGTAGGTGAAGCGCTGGTGTCCGAACTCGAGCGCCGTGGCGTCCGGCT 

GTCGAGCGGCGTGGGCCTCGAAGAGCGCATGGACGCAGGCGGGGGCCGGG 

GCGGAGGCGGCCTGTCGTGCGGCGGCAGCGCCGCTCCAGTCGTCGAGGAG 

CAATGCGCGCTCGGCGTCGGAGAGCATCCGGAGCTCGGAGAGCGGTCGAC 

CGGGGTGCTCGACGGCGCTTTCGAGCAGGAGCACGAAGTGGCGCGCCATC 

CGCTCGATGGTGGCGGGGTCGAAGAGCTGCTGGTCGTACTCGAAGCGCAG 

GGCGATGCCGGAGTCGAGCTCTGCGGCGAACAAGGCGAGATCGAACTCGG 

CCGCTGCCTGCTCGTCGGCGAGCGTGGTGAGCTCGAGCTCTCCCTGCGCG 

ATCCGCACGTCCTCCGCGCCGGTGGTGAGCGCGGCGAGGCGGGGATCCAG 

CGACGGCAGAGCGCCCTGGAAGGCGAAGGCGACGTCGAAGAGCGCGCCGC 

CTCGCCGGGCCGCGCCCCGGGGCTCTGCGAGCAGGTGCTGGAGGGCGCTG 

TCGCCGTGGGCCAGCCCGTCGAGGAACGCGTCGCGCACGCGGGCGACGAG 

CGCGTCGAAGGACGCGGCCCCGCGCAGCGCCACGCGCACGGGGAGCATCT 

GGACGAAATAGCCGAAAGCCCGAGTGCTCTCGTCGTGGTTCCGCCCCGCC 

GAGGGGACGCCCACGACAAGGTCGTTCTGCCCGCTCGCGCGATGGAGCAA 

GACGGTGAGCGCCGACAGCAGGACCGAGAAGAGCGTGGTCCCGCGCTCGC 

GCGCGAGGCGCGCCAGCGCTCCGGTCAGGGGCTTTGGCAGCGTGATCGCG 

TGAGCGCGACCGCGGCGAGGGCTCGCGTCGTGGCGGGCCCGGTCGCGGGG 

AAGGTCGATGGCGGTCGTCGCGCCGTCGAGCGCCTTGCGCCAGTATTCTG 

CTCCGCCGGCCGCCTCCCGCGGCGAGGGACAGCTCACGCCGGCGGCGAAG 
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AAGCTCGACGGCGGCGGCAGCTGCGGGGGCCGGCCCGCGCGCAGCGCCGA 
GTACAGCTCCCCCAGCTCGCGAACGAGCAGCGCGAACGACCAGTAGTCGA 
CCACGAGGTGGTGAACGACCACCGTGAGCAGCGGCGGCTGCCCCTCTCCG 
CGCCGCCAGACATGCACCCGGAGCAGCGGTCCGCGCTCCAGGTCGAACGC 
5 GCGGCGGCGCACCTCGTCCGCGCGGGCGACGATCTCGCGCTCGTCCAGCG 
^5 CCATCGCCGGCTCTTCGGCCCATTCCAGGGCGACATGGCGGTGGACCTGC 
TGCAGCGGATGGCCGTCGCGCGTGAGGAACGTCGTGCGGAGCGCCTCGTG 
.CCGCTCGACGAGGCCCTCGAACGCGCGGCGCAGCGCGGCCACGTCGACGC 
CGGCACCGAGCCGGACCGTCCTGCCCAGGTTGTAGAGCGCGCCGTCGGCC 

20 

10 GACTTCTGGCACTCCAGCCACATCGCCCGCTGCCCCTCGGTCAGCGCAAA 
CGGCTCTTCCGGCGTCGCCACCCGCGGCGACGAGGGGCCGTCCGGACGGG 
TGAGCCCTCGCTCCAGGGCCGTCGCTGCGGCGGCGGAGGTCGAGGCGGCG 
25 CCGGAGGAGAGATGACTGGCGAGCTGCGCGAGGGTTGGGTGCTCGAAGAG 
CGTGCGGAGGGTGGTGCGGATGCCGAGGGAGGACTCGATGCGTCCGAGGA 
1 5 CCTGCATGGCGAGCAGGGAGTGGC CG CCG AGGTCGAAGAAGCTGTCGTGT 
CGTCCGACGCGGTCGAGGTGGAGGACGGATTGCCAGATGTGGGCGAGCTC 
2^ GAGCTCGAGCTCGCCCGAGGGGGGCTCGTAGTCGGCGTGCGCGGCGGGGG 
GCGCAGGGAGGAGCTTCTTGTCGACCTTGCCCGAGAGGGACATGGGCAAG 
GCGGGGAGCAGGACGAAGTGGGCGGGCACCAGGGCGTCGGGCACCAGGCG 
2 0 GGCCATGCCTTCGCGCAGGTCGCGCTCGGAGGGCGGGTGGGCGTCTGGCA 

35 

CGACATGGGCAATCAGGTGCGCGGCGCTGCCTTGGCCGTGGAGGACGACG 
ATGCCCTCGCGGACGCCGGGCAAGCGTCGCAGGACGGATTCGACCTCGCC 
GAGCTCGACGCGGCGACCGCGGAGCTTGACCTGCTCGTCGCGGCGCCCCG 
40 CGAAGGCGAGCTGTCCGTCGGGGCGCCAGCGCACCAGGTCGCCGGTGCGG 

2 5 TAGAGGCGTGCGCCGGGCTGGCCGAAGGGATCGGGCAGGAAGCGCTCTGC 

GGTCAGGTCCGTGCGTGTGTAGCCCTGGGCGAGGCACGCTCCGCCGATGT 
ACAGCTCGCCGAGGGCGCCGGGCGGGACGGGCTGCATGTGCGGGTCGAGG 
ACGTAGACGAGGGCGCTGTCGACGGGTCGGCCGAGCGGGGGCTCGGCGCC 
GAGGTCG3CGATCTCGGCGACCGTGGTGATGACGGTGGCCTCGGTGGGCC 

3 0 CGTACA7GTTGAAGAGGCGGAAAGGGAGCGGGCGCCGGAGCGGATGGAGC 
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TTGTCGCCGCCGACGGTCATCGCGCGCAGGGCGGAGCCGGTCCAGTCTTG 

CTCGAAGCACGCCTCGGCCAGGGGCGTGGGCATGAATGAGAGTGTGGCCC 

GCTGAGCGACAAGCCATGAGACGAGCGCCGTGGGAGAGCGGAGCGCGTCG 

TCGTCGGCGAGGAGGAGGGCAGCGCCGCAGGCGAGCGGCGTCCAGATCTC 

GTAGACGGAGGCGTCGAAGCCGCTGGAGGCAACCTGAGTCCAGCGGTCGC 

TGGGCGAGAGATCGAGTCGGAGGTGGAGGAAGGAGACGAGCCTTGAAAGG 

CTCGCATGGCGCACACAGACGCCCTTGGGGGTGCCGGTGGAGCCGGAGGT 

GAAGAGGACATAGGCCAGGTCGTCGGGCCTGGAGACGAGAGGAATGTGGG 

TGCTGGGCGCGCACGCCCCGTCCTGGACGAGGTGGACGGGGCAGGGGGCG 

GCGGTGAGCTTGTGGCTGGCCTGGCTGCTGGTGAGCGCGAGCGAGGCGCG 

GCAGTCGGCGAGCATCTCGGCCAGGCGTGCCGGGGGGTTGGCGGGGTCGA 

GC3AGGCATAGGCGGCGCCTGCCTTGAGGACGGCGAGCTGGGCGGCGACC 

ATGCGGGGCGAGCGCTCGATGCAGACGCCGACGACGCTGCCGGGGCCGAC 

GCCGCGGTCGCGCAGCCACAGGGCGAGCTCGGTGGACCAGGTGCTGAGCT 

GTGCGTAGGTGAAGCGCTGGTGGCCGAACTCGAGCGCGGTGGCGTCCGGC 

TGTCGAGCGGCGTGGGCCTCGAACAGCGCGTGGACGCAGGCGGGGGCCGG 

GGCGGAGGCGGCCTGTCGTGCGGCGGCAGCGCCGCTCCAGTCGTCGAGGA 

GCAATGCGCGCTCGGCGTCGGAGAGCATCCGGAGCTCGGAGAGCGGTCGA 

CCGGGGTGCTCGACGGCGCTTTCGAGCAGGACCACGAAGTGGCGCGCCAT 

CCGCTCGATGGTGGCGGGGTCGAAGAGCTGCTGGTCGTACTCGAAGCGCA 

GGGCGATGCCGGCGTCGAGCTCTGCGGCGAACAAGGCGAGATCGAACTCG 

GCCGCTGCCTGCTCGTCGGCGAGCGTGGTGAGCTCGAGCTCTCCCTGCGC 

GATCCGCACGTCCCCCACGCCGATCGCGAGGGCTGACAGGCGTGCATCCA 

GCGATGGCGGGGTGCTCTGGAAGGCGAAGGCGACGTCGAACAGCGCGTCT 

CGCTGCGCCTCGCCCTGCGCTCGCGCGAGCAGGTGCCGGAGGGCGCTGTC 

GCCGTGGGCCAGCGCGTCGAGGAACGCATCTCGCACGCGGGCGACGAGCG 

CGTCGAAGGACGCGGCCCCGCGCAGCGCCACGCGCACGGGGAGCATCTGG 

ACGAAGTAGCCAAAGGCCCTGGCGCTCTCGTCGTCGTGCCGCCCCGCCGA 

GGGGACGCCCACGACAAGGTCGCTCTGTCCGCTGGCGCGATGGAGCAAGA 

CGGTGAGCGCCGACAGCAGGACCGAGAAGAGCGTGGTCCCGCGCTCGCGC 
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GCGAGGCGCGCCAGCGCTCCGGTCAGGGGCTTTGGCAGCGTGATCGCGTG 
AGCGCGACCGCGGCGAGCGCCCGCGTCGTGGCGAGCCCGGTCGCGGGGGA 
GGTCGATGGCGGTCGTCGCGCCGTCGAGCGCCTTGCGCCAGTATTCTGCT 
CCGCCGGCCGCCTCCCGCGGCGAGGGACAGCTCACGCCGGCGGCGAAGAA 
GCTCGACGGCGGCGGCAGCTGCGGGGGCCGGCCCGCGCGCAGCGCCGAGT 
ACAGCTCCCCCAGCTCGCGAACGATCAGTGCGAACGACCAGTATTCGACC 
ACTACGTGGTGATCCACCACCGTCAGCACTCTGCGTTTAGTCCTTTCTCC 
TGTTCGGCCTATTAATTGCTACTATGGATCCACACTGCTCCGCCTCTTGT 
ATCTCCCTTATCTGCACTTGCTGCGCTTNACCTTTACGTTCCTCTCCCTG 
CTCTATACTATTTCTTCCCCCGCTTCTCGTCCTATTCTGCATTTGTCATA 
TCGTATCTTCATATACCTTTCTTTCGCTATCCTTACTGCTTCTCGACCTT 
ATGTGCGTCTGTCTTCCCTTTCTNTATTATTTCTCTGTCTCACCGCTCTN 
TGCTCTGTCGCTCCTATCACTAAATTATGTCTCTATCACTGCTACTATCT 
GAAGCTGATCTTCGAGATCTCGCTNGGTGTCACTCTTTATCTCATAGNCG 
CCTCTGTCTTCTTGTCTCCTTAAGNCTGATTTTCTCGCTCTATTCGTGAC 
TACTCTGCTGTCTCTCACATACGTGTTCTTGAATCGTATTCGCGTTCTCG 
CTACTGTGATATCCATTGCCGACCTCTACTGCTCNTCTNTATGCTATACT 
TCTTAGTCTCTTACTACGTTNGTCTGATATNTTGCTGACGACGTCATGTC 
ACGCTCGCAACTCTTCANTTCTATCGTATACGCTGATCATCATTTTCTGT 
GAGGCTGATGTACTATACGTAATTACCTGTATACGTCGTCTATCTACTCT 
CGTGTCTTCACTCTTTCTACTCC 

Sec ID No 91 (>Contigl2) 

CCCCCCGCCGTCCGCCGGTACGTCGCGGACCGCCGCCCCGAGCAGCTCCC 
CGCGCTCGCGCCGGAGGAGCGGGAGGCCGCGGCGCGCCGCCTGTCGGCCC 
TCGGCGCGGCGCCGCCGCAGGTCCGGCGCCGCGGGCTGACGCGGGCGCCG 
CTCTCGTACGGGCAGAGCCGCATCTACTTCCTCGAGCAGCTCTCGCCCGG 
CAAGCCGCTCTTCAACGTCCCGGGCGCGGTCCGGCTCCGGGGCCCGGTCG 
ACGTCGCCCGCCTCTCGGCAGCGTTCGGCGAGATCGTGCGGCGCCACGAC 
GCCCTCCGCACGTCGATCGCCAACGTCGACGGCGAGCTCCTGCAGATCGC 
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GCAGCCGCACGCGGGCTTCGCGCTCGACGTGGTGACC7CGACGCCCGAGG 
AGGCGGCCGAGCTCGACCGGCGGCTGCGCGCCGAGGCGTGGCGGCCCTTC 
GCGATCGGCGCGCCGCCGCTCCTGCGCGCCACGCTGTTCCGCCTCGCGGA 
GGACGAGCACGTGCTCCTCGTCACGATGCACCACGTGGTGTCGGACGACT 
5 GGTCGCTCGGCGTGATCCTCCGCGAGCTCCTCGCGCTGTACGCGC-GCCGC 
15 "T^GCTCCCGCCGCCGCGGCTCCAGGTCAGCGACTTCGCGGCGTGGCAGCG 
CGAGATGGTCGAGTCGGGGGCGCTCGACGGCCAGCGCGCGTACTGGCGAG 
AGCGCCTCCGGGGGCTGTCCCGGGCGAGCATCTCGGCCGGCGGCGGGGCG 
GAGGCGCCGAGCCACGACCCGTCCGGCGCCATCGAGGAGATCGCGCTCTC 
10 GCCGGACAAGGCGGCGGCGCTCGAGGCGCTCGCGCGGCGGGAGGGAGCGA 
CCCTGTTCATGGTGCTCCTCGCGCTCCTCGACCTCGTGATCCATGCGCGG 
TCCGGCGCACTGC-ACATCGCCGTG3GGACGCCCATCGCCAACCGGAACCG 
CCCGGAGCTCGAGGACGTGGTCGGCCTCTTGACGAACACGCTCGTGATCC 
GCGTCGATCTCGCGCGCGCCGGGGCGTTCCGCGACGTGCTCGCGCGGGCG 
CGCGTCCAGGCGCTCGACGCCTTCGCGAACCAGGACATCCCGTTCGATGT 
CGTCACCCAGGATCTGAAGCAGGAGCGCGACCACGCGCAGCACCCGCTCT 
30 TCCGCGTCTGGCTGGCGCTCCAGAACGCGCCGAAGCCCGCGCTGGAGGTC 
CGCGGGCTCCGGGTCGAGCCCCTGCCCCTCCGGCCCGAGCTCGTGCACTT 
CGAGGTCGCCCTCCTGCTCTGGCCGGCGGACGACGGATCGGTCGTGGGGC 
2 Q ACTTCGAGTTCCGGCGCGATCGCGTCGACGAGGGCGCGCGCAAGGAGATC 
GCGGCCGCATTCACGCACCTCGTCGACGCGGTGATCGCCCGGCCGGACGC 
GCCGGTGTCGACGCTCGTGGAGGGCGCCCGCGCCGAGGCCGCGCGAGCGC 
AGGCCGCGCTCGGCGAGGCGTTCGCCAGGGCGGCGACGGCGCGCCTCGGC 
^ CAGCTGCGGCGTCGCTCGGCGGGCGACCGGACGCCCCGCGAGTAGCGGTC 

2 5 AGCCCTCGGCGGCGGCCAGGCGCACGCGGAACGGCGCAGGGTAGCCGTGG 

ACGCGCGGCATGGGGTCGATCGCGCTGGGGACGCCGGCCCGCAGCAGCTG 
CTTGATGGCGAGCGAGATGTGCAGGATGGCCACGTACTTGCCGTGGCACG 
TATGGATCCCTGACTCCCAGAACA3GTAGTTTCTGCTCGGCAGGCGCCCG 
GGCCTGAACTGGTCCGGGGCGTCGATGTGCTCGTGGTCGTGCATCGCCGA 

3 0 GGCGCTGCAGGCCATCACCAGCGCGCCGGCGGGGACCTTCTCCTCGTGCC 
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GCGTGCCACGCCCGACCGTGTAGTCGCGCACGCAGAGGCTCGTGACGCCG 

GTCGACGGGGGACGGAAGCGCAGCGCCTCCAGCACATAGCCGGTGATGGC 

GGCGTCGTCCTCGACGTTCACCACGTTGAGCGCGTCGCGCAGGACGCGCG 

GGCGCTTCATCAGCTCGACCAGGGCGTTGACGATCGCGCCGCCGCTGAGA 

TCCACGCAGCCCATGAGCAGCCCCAGGATCACGTCGCGGATCCCCTCGTC 

GCTCTCGTAGGTCTCGGGGACCGACTGCATGAGCAGGTAGCGGTCCAGCA 

CCGAGGGTTGCTCTGGCGGGGGCGACTTGGCCAGCTGCTTCTTCCGCGCG 

GCGACGATCGCGTCGATCATCGGCAGCGCCTCCTGACGAGCGGCCCTCGC 

CGCCGCCACGGCCGTCGGGTCGTTGGTCGGGTTGAGGAAGATCTCGTTGA 

ACAGCGCGTGGGTCCACGCCACCACCTTCTCGGTCGGGATCTCGCCGACG 

CCGAGGTACCGGGCCATCGCGCCGGCCGGCACCCTGAGCGCGTAGTCACC 

GGTGAGATCGAACGGCTTGTCGACGCCGACCTTGGCGAGCAGCCGGTTCG 

CCTCGTCCACGACGATCTGACGGTAGCGGGGCAGATCGGCGCGCGGGAAC 

GCGAGGCGCAGGAGCGACTTCTCGTGCTCGTACTTGGGCGAGTCGTTCAT 

CGCCAGGATGTTCTGGCCCACGTTCTCGACCAGCTTGGGCGCGATGTTGT 

CGACCGAGAAGACGTCGTTGGCGTTGAGGACCTCGACGACGTCGTTGTAC 

CGGGTCACGAGCGTGATGGCCGGGATGGAGAAGATGGGCTTCTCGCGCCG 

CAGCTGGCTGAGGAACGGGAGCGGCTCCTCCCTCAGCCACTTGAACACCA 

TGCCGGCCTCGATCTGCTTCCGCTTGACCGGATCGTTCTCGTGCGCCAGC 

GCGCTGTGGAGGGCCTGCAGGTAATCGAACGGCGGCGCCTTGGCAGCGTC 

CGCTCGTCCCTCTTCTTCGATGTGAATGCTCATGGGGAGAATTCCTTTCT 

CGCATGCCGATCAGATCGCGACGCTCTGGGGGACCATCGACGGGAGCAGG 

TACAGGTACGGCTGCTCGCGGGCGCGGTTGCGCTCGGTGATCTCCCGCTC 

GATGTGCGCCAGGCGCTCGCGGTAGCGAGCAAACGCCTGCTTCGCCGCCG 

GATCTTGCAACAGGCACTTCATCGTCATCAGGCTCTCGCTGGTCAGCGGC 

GGGCCCATGCTGAGCGAGCGGCTGAGCACCATCTGGCGGATGCTCTGCGC 

GCGCCCCGGGAGGCGCTCCAGCGGCTTGAACTGCCGCTTCTCGCTTCCGT 

TCAGGACATCGCCGTAGGAGCGGTAGGTGGCAAACTGAGCGTTCGGGATC 

CAGGTGTAATAGTCCGTCTGCCCGAAGTTCACGGCGGCGTGATATGCGGT 

CGCCGTGAAGATGATGTTGGTGACGATCGCGATCAGGTCGTCGAGGCTCG 
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TGAGCTTCTCGAGCTGATCGGCTCGCTCCGGCGGGAGGAGGCTATCCATG 
CCGCCGAGCTGGGGGGACACGAGCTCGTGGATCCACCGCTGCAGGCTGGC 
GTCGCTCGACAGAGACCCCGGCGTCGGGTAGGCGATCTTCAGCACCTGTC 
CGACGTACTCCTGGATCGCGTCCCAGTGCAGCAGCGCGTCGTCGCGGTAG 
5 TGATAGCCGACCAGGTCGCGGACGTCGCGCGCCGACAGGTCGCGGGGGAG 
15 CGCGCTCTCGTAGAACCGCCACGGCTTGCCGCCGTACCCTTTGATGCCCT 
TGCCGGTGTAGGCGCGCGTCAAGAGCTCGAACGAGCCCATGGTGGCCACC 
GAGCTCGTGATGTCGAAGAAGCGCCCTCGCCCGAGGAAGCGCCGGCGAGC 
CAGCTCGTTGATGGCCAGGGTGTTGAAGAAATGCGGCCTGAGCAGCTGGT 
GGAGCGGATGCGTCGCGGGCAGGTTGCGGTAGGTGCTCACCGCGAACGGC 
TCCACGATCAGGTGCGCGTACAGCAGGTGGGTCACCTGGCCCTGGTAGAT 
GGCGTCGGCGCTCGCGACGGCGATCTTCGCCGTGAGCCAGTCGTCCGACG 
25 GACCCGAAGGGGTGAAGATCTTGTCGGGATGCGCCCCTTTCCCGGGGCGC 
GAGTGCACCAGCCTGATGGCCACGGGCAAGAGCTCACCGGCCGCGGTCTG 
1 5 GTGCAGCATGCACGTCGGCGCCAGCGGGTACTTGCCCAGCTCTTCCTGCA 
CGTCGGTGTCGACGATGTCCTTGAAGATGCGGTAGTCGAGGAAGTAGAGC 
30 TGCCCGCCCTCGCGCACCTCCTCCAGCGTGCGACCGTCGGCGATCGCGAT 
CGGCTTGGGCTCGGCGCCGCTCACGAAATCGGCGAGATCGGCCGGGGTCG 
CGCGGCGGATGTGCGCCGGGTTGATCCCCACGAGGCGCTGCCGCCCGAAC 
2 0 TCGGCGTCCTCGGCCCAGCGCGTCGCCACGAGGGGCTTGCGGATGAAGGT 
CCACGGCTTGAAGAACTCCTCGAACTGATCGAAGCTCTCCCAGTTGTCGA 
TGGACTCGAAGATGGCGCCCAGCCCGAGGTCGGACGTGGCCCTGAGGACG 
AACTTCCCCTCGCGATAGCGCTTGTACCCGTACTCGAACAGGTGAAGCGC 
^ CTGCGCGATTTGCAGTCCGGCAGTGTCCTTCCACTTGCCGAGGTTGAGCG 
2 5 CCAAGATCTTCTTGATCGGAAAGCCTTCCCCCGGCGGTACGGAATCGCTG 
CCCGCCGGCAGGTTGGACCCGAAATTGCGCCAGTTCGGTGTCGAGCTGGG 
CTCCCTCATGCTCGTGCTCGCTTCTCCGTCTCAGACGGACGGTGGATTGG 
GTGGTTCACGTCAAACATCGCTCTCGCGTCGCAGCGGTCCGAGCGCGCGC 
CGGAATGGTTCCGTCTCAGTCGCAACAGGACTCAGTACATCCAGCGCCGC 
30 CCCCCGTCCTCGACCTGCCCCCGCAGCCGATCGCGCCGCCCTTCATCGTG 
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GAATCGACAGGTGCGATTCCACGAAAAGCCGCCGCGCCGAGTTGCACGCG 
ACCGATGCTCACGCGTGCATTGTTGAGGCTGCTAGAAAACCGTGGAGCGT 
TCACGCATGTCAAGCCATTTTGTTCGGCGCCGCGGCGAGCGGCCGGATGC 
CGCGCGCCCCCGCGCCGGGCGTGTTCGCTCCCGACGTACCGCTACCTCGA 
CGACGTATGGCTTGAAGGGCAACCGCGCAAGTCGTCCGATTCGTGCTCGT 
ATCCTGCTCCTTCCAGCAGGATTTCCCCGCCGCCAGCGGCACAAAGGTGC 
CAGGGCGAGCAGAAAGAGCGCTGCCCGCCCCTCCCCGGCCGCGCTCGCCT 
CGATCAGCGCGCGCTCGTCGTCGGTGCCTGAGACGTTCGACGGACGTCAG 
-TTAGTTAGCCTAGCTAACTTCAACACTGATGCGACTGATCGGGCCGACGC 
AACCGACGCAACCGACGCAACCGACGCAACCGACGCGACCGACGCAATCG 
ACGCAACCGACGTGACGGACGCTGGCGACTCGAAGAAAACCACGGACGCA 
CTCCACGTCATCGACGTCATCGACGTCATCGATGCGCTCGATGCAATCCA 
TGCACTTGACGCGATCGGTGCGAGCAGGCGACGAGGTCCTCTCGTGAAAC 
ACCGAACCGAGTGCCGGTAGCGGGCGCGCCGCAGTGTATGCTAGGCTCGG 
CCCTCTTGTCGAGGCCGCGCGCTCGGCGGTCGAGCGTGGGCTCGGGTGCC 
GCGGTATCCGGCTGAACCAAGGAGGAGCGAGCCATGCAGGCAGATGACGA 
CGCGACGATCTACAAGGTGGTGGTGAACCACGAGGAGCAATACTCCATCT 
GGCCGGCGGACCGAGAGAACCCGCTCGGCTGGACGGAGGCCGGCAAGACG 
GGCAACAAGGCGGAGTGTCTGGCGTACATCCAGGAGGTCTGGACGGACAT 
GCGCCCGCTCAGCCTCCGGAAGAAGATGGCCGAGAGCCCCTGAATCGCGG 
CCCGCCCGAGCGCCCGTCGCGAGCGGCCGGGCGGCGGGCTCAGCCGTGTC 
ATCGTCGCGCTCGACCGGCCGCGTCCCGCGGGATCGCGCGAGCCCGGCGG 
GGTCGTGCGCGCCGGCGCTTGTGCCGGGGCCCCCGCTCTCGTACGCCTCC 
GTCATGCCGCCCCTCGATCTGCACGTCGCCTTGTTCGGCGCCTCCGGCGC 
CGGCAAGACGGTCCTCCTGGCAGCCTTCTACCGGGCGCAGACCCAGCCCT 
CGTTCCAGCAGGAGTACGCGTACAAGATCCAGGCGGTCAACAAGGCGCAG 
GGCAACCAGCTCCTCGGCCGGTTCTATCGCCTCGAAGAGGGCAGATTCCC 
GGACGGCAGCACGCGCTTCGACGAGTACGAGTTCGACTTCTTCCCGAGAG 
ATCTGCCCGAGCCGGCGGTCCGCATCCACTGGTACGACTACCCGGGACGC 
TGGTGGGAGGACGAGCCGGTCGACGCGGACGAGCGGGAGGCGATGCGCCA 
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GGGCCTCATCCGGCTCGGGATGAGCCAGGTGGGCATCCTCCTCGCGGACG 
GCGCGAAGTACCGGGCCGAGGGCACCGGGTACATCCGGTGGCTGTTCGAG 
CACTTCGCCGACGAGTGCGACCGGCTGCGCCGGGCCAGCGCCGCCACGGG 
CGACGAGGTGAGCTTCCCGCGGGAGTGGATCCTCGCCCTCAGCAAGGCCG 
5 ATCTCTGCCCGCCGGACTACAGCGCGCGGGACTTCGAGCGCGAGGTCTGC 
15 CGGGACGCCGACGATCAGCTGGCGAAGCTCTGCTCGGTGCTCCGCGCCGA 
GCACGCGTTCGGCCACCGCTTCATGCTGCTCTCGTCGGTCGCCGCCCCGG 
CCGGCGCGCAGGTCGATCCGAGGACCTCGCTCGGCGTGCGCACCCTCGCC 
CCCGCGATCCTGGTGAGCACGGTCGAGGGCGCGGTGCGCGAGGCGCAGGC 

20 

10 GGCGAGAAAGGAGAAGTCGGCCGGAGAGACGTTCTTCCAGGGGCTGCGCG 
ATCTCGTGCAGTTCGTCGACTCCCTCGACGACTTCCTGCCGAAGCGATAC 
CAGATCGTGAGCAAGATCCTGCGGTTCATCTCGATCAAGGACTTCGCGAC 
25 CACCCGGCTCGACCGGCTCAAGAAGATGCGCGAGGACGCGATCCGGAAGG 
GCGACACCTTCACGGCGGTCCTGACCGCGATGGTCGCGGCCCTGCGCGAC 
15 GACGAGGGCGCCCGCGCCTACCACCAGAACCAGTGAGGTCGTCATGCCCG 
CGCCAGCGCCCCTCGTCGAGACATCGCGCCTCCTCTGGAGGACGCGCGGC 
30 GAGCACTGGGATTACGAGTTCATCTGTGTCCCCGAGATCCCGGCGCTGCC 
CGCCTGGCTCTCGACGCTCGAGGCGATGCTCGCCGACGCCGACGCCGGCG 
CCGGGGAGCTGCGCTATGGCCTGCTCGAGATCGACGATCGCGGGCAGAGG 
2 0 GCGCCGCGCGCCTATCCCTACGTG3CCGTGAGGTTCCTCGATCCGGCGCG 

35 

GAGGGACTGGACCGGACGGCAGGTCCAGCACTTCGCGGCCTGGTTCCCGC 
CGGTCCCGCCCGAGGCGGTCGCGGAGTTGCCAGAAGCGGTCCCCGCCGAC 
TGGCACCTTCGCGTGCTCGACGGGCTCGCGGGGACGTACGGCTCCGGCGA 
GGTGTTCGGGCTCCCCGAGGCGAC3ATCCGCGCCTGGAAGCGGAGCCACG 

25 ACGAGAGCCGGGCCGCGCGCGCGATGGCGATCGTCAAGGCGACGCCGCCG 
GTTTCGCTGGGCGGCGGCGAGGCG3CGCCGTCGCGGTGGACGCGGGTGCC 
GACATTAAAAAAAAAGCCGCCGGAGCCGCCGGCCGCGGCGGGCCTCCTCT 
CGGTGGGCGCGGTCCCTAGCGGCCAGGGCCGGCGATTCGGCTGCTTCGCG 
ATCGGCGCCATGATGCTCGCCGCCTTCTGTCGACTGATGCTCGCTTGCGG 

3C TGTGCGCCTCCTCGGCGCCTGACG3CTGCGCCGCGCAGGCCATCCGACGG 
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GGGGTCGGCCCGGCCAGCGCCCGCCGGGCGACACCAGGGCATCGGCCCTC 
CGCTCGGGGCATCGATTGAGCTCTCCGAGCGGCGGTCCGTCGTCAATCGC 
CGCAGAGCTCCCACCC-GGCGGAGCAGCTCTGGCCGGTGACCGCATAGGGG 
TTCGTCGGGCAGGTCCACCACTCGCCCTGGAAAGGACGCGGGTTGCAGTG 
5 CGGGAGGCACTCCACCCACCCCGACGAGCACGAGTTCCCTACCGAGACGG 
15 TCGGCTGAGCCGCGCAGAACCAGCGTTTTCCTGCGAACCAGCCGGGATTG 
CACACATCGGGCGCCCCGCCGACGGGCGGATACACCGTGGCGACCGCCTG 
GATGTCCACGGCGTCGAGGGTCTCGCTCCCGAAGCGCACGCCGTCCTGAT 
CCGAACACCCACTCCAGTAGGTCATGATGGAGTCGTAGTCGTAGAAACCA 

20 

10 GGGTTCACGACGATGTACCGCCGGCTCGAGGGCCACCCGCTGGCGACGTC 
GCTCGCGGGGAGCGGCTCTCGTTGGCTGCAGGCGCTCGGGACCAACGGGT 
GATGCCACTCATGCATGAAGCCGATCGCATGACCCATCTCGTGGATCGCG 
25 TACTGCTCCACGCAGTCGAAGCTGTATTCGACCCGGGCTGTCTGCCAGTT 
GTACTTGATGCAACGGTTGAAGTCGGCGCCCCAGGGCTTGAACTGGACCG 
1 5 AGCCGCCCTTGTTGTAAACGCCGATCGAGTCCGATTGGTTGGGCGCGTCG 
GGGTGGATCCTGACGCCGACGTAGGTCATGCGAGTGGCCGGCAGGAGCGA 
30 ATCGCAGCTCTCCCAGCCGGTGAAGCGAACCGAGCTCCAGCGTTCCCAGC 
TGCCCTGGAGCGCGGTGCGCACGCGCGTGATGACGTCCGCGAGCGAGGGG 
TTGGGCGCATGGATCAGCCCGCCCGCGGCGCCGTCGACCCTCTGCTCCGC 
2 0 CGAGCTCGTGGGGTCGATGCAGACCGGGATCCGGACATGGCCGTCAGCGT 

35 

CCTCAGGCCAGCGACTCGCGCTGTCGAAGACGCTCGCCTCGGCGGACCGC 
GGCGCGGCGGAGACGGTCAGCGCGGCGCCCAGCGCTGCGAGGAGCAGCGG 
ACCGAGCGAAGAGCGAAACCGCACATGTCGTTCAGGGCCCCGCGTCGTGC 

^ GGTGCACCGAGACAATCTCGAGCGGGCTCATGGACGCAAACGCGTTGCGA 
2 5 TGGCCTTGCAGCATGTTCTTCTCCAATCGACGAGGGTTGTTCTGCTGAAC 
GCGGCTCCAGCGTGGAGCTCGACGCGGTTCACCGGCTTCACGCCGGGGCC 
GTGGACGAGACCCGAGCACGGGGGAGGTCGCAGCCGCACCGGCTCGCGGC 

^5 GGCTCCACCCTGCACCTACGACGAGCCTGCCGCTCGGTTTCGCGGAAAAT 
GCCACCCCGCTGCCCAGCGGGCGAAGCGCGGACGAGGCGCTCGTCCCCAC 
30 GGTAGCGCCGGTGCCGCTGCATCCACCGCGCTCCTCCATGGGTCGCTGCC 



55 



wo 00/22139 



211 



CGCGGGTCGTCGAGGAGACGGACCCGGGGCGCGGATCCCTGGCTCGGCGT 

CGCATAGCTCGTAGGGGCGGCCTTGAGCCGGCGGTACGAGCGGCGCAGTT 

CAGCAGCCGACCACGTGGACGCGGCGCGCTCCGAGCTGCGCCGAGGAACC 

CTTCAAATATTCAGATGGAATTCACAGGGTGGCTGAGAGACGGC<3AGTAA 

GATCTCAGAGATCTCCCTGCCTACCCGCATCCCTGTTCAATTTTCCGCCC 

ACAACGCGAACGGATGAGGAAATATCAGCCCGCGATCCCGACGGCCGACA 

GCATCAAAGGCCGCTCGAATCCAGGGGATTCGAGCGGCCTCGGTCGCGCG 

GACCCCCGCCGCGAGCCGCTTTGTCACCACTTCACCACTTCAGAGCTTCG 

ATCATCTTCTCACCATAACGCGTGCCCATGATAACAACGGACGCATGATC 

GAAGTGGTACTGATCCATGACGTTGGTCCCCTGCTGGCTGACCCAGTACC 

CCATAGGCAGCATGTCGGCCGCTTGGTGCACGAGGTTGTTATGACCGCCG 

CAGCACCCGCCCGCGGGGAGCTCTCCGAGAATGAAGGGAACGTCGTAGTC 

GACCCCCCAGGCTGCTTTCACCTCGTTATAGAGCTGAACGACCTTGCCGG 

GCCACGAGCTCTGGCCGTTGTCGGACTCACCCTGGTGGAAGATGATGCCC 

GCGAAGCGCGCGTTCTCGGCCGTCTTCGCTTTGGCGATCTTGTTCAAGAT 

CATCTGGTGATGCGAGCCACCAGTGATGAACGTGTTGATCGACTCGCCGC 

TCTCAGCGGTAGCGACCAACCCGATCGTATCCCCCTCAGGCAGCTTTCCG 

A3CAGGGTCTTGCCGAACCAGATGCCCGGGTCGACGGAGGTCGACAGGTT 

CCATCCTTTTTCACCAGGGCAATCGCTGAGCGGCGGATTGGCCAAGTTCC 

ACTGTCCGGCCGGCTGATTGCATCCGCCGAGGACCTTGAGCCGCGCGTCA 

GAATTTTTGTCGCTGTCCTGTTTGTCTGCGACACCAGCCATATTCGACTG 

GCCCATGAGCATGAAGATGTGAAACGTCGGACTCGCGCTCGGTGCGCCGC 

CGGTGCCTGCCCCGCTGCTAACGGATCCGGTCCCTCCCGTGGCGTCACCT 

CCAGTTCCAGCGTTCGTGCTGCCTGTCGCGTCGCCCCCGGTCCCCGCGCT 

CGTGCTGCCTGCCGTGGCGCCGCCGGTCCCCGCGCTCGTGCTGCCTGCCG 

TGGCGCCGCCGGTCCCCGAGCCGGCCCCTCCGGTGTTGTCGTCCTCACCG 

GTCGCGCCGGACTCGCCACAACCGGACGCAGCGATGATGAAGAGGAATGG 

GAGGAGCAGGAACCTGGGTGTGCCTCGGGTCGTGCGGTTCATCTCGGTCA 

TGATCGTTACCTCGTCGCGCCGGGGCGCGATCTGAAGAGCATGGCGGAAT 

CGGTAGGCCGGCGTCGCGATGCCGGCGCGGCGAACCTCGCCCGCAAAGAG 
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CTCAGCGCCGGGGCCTACCTTATCGCATCTTGGGCGCTTGGCGTCCAGGA 
TTCGGCCTTAGACAGCACAAGCAGAAGACCTTTGACACTGGATTTTTTCA 
TCATCGGCGGCGCTCGTTCTTCGCTGCGCCTCAAGCGCCGACCGTTCGTT 
TCGAAGCGAAGCGGTTTCACGATCCGGGATCGCGGAAATTTGAAACGGAC 
5 GCGTCGCGCGGGCAACGCAGGGGACTCATCACGAGGCAACCGCGCTGCGT 
15 CGCGAAATTGGCCAGCCTCTCGGAGTCCCTAGTTCCGTGCGTCAGACGCG 
TCACCCACCATGTCGAGCTCGGCGCGGCGCTCTACGTGCTTGAAAGACCT 
CCGCGAGCGCCGCGCTCTGCGCTCCGCGAGCGCACCAGGCTCCCCGTGGA 
TTCAGGGCAAGGCGGTCGTGATCACGTCCTTCGTGGTCGCCGTGCTCTGG 
CCGGGGCGGTCGACGCAGGACGGGTAGTACTGGCCGGCCCCGTTGAGCTT 
CTGGCAGACCCGGTCGCACGACCCGGCGATGCGGATCAGCCCGCACTCCA 
CGATCTGACCCCCGGAGGTCACGTGGCCCGCGGCGCAGTCGCGCTGGTAG 
GCGCGCGAGTTGTCGACCGTCGCGCTGTTGTAGCAAGCGTTGATATAGGG 
TTGCGCGGCGAACAGGTTGCCCCAGAAGGCCCCCTCGACGTCCGGATAAT 
CGATGAGCTCCTGGCTGGAGGAGAGCGTCTTCAGCGGATCCCGCAGCGAG 
CGGGCGGAGAGGAGCACCGGTACTTGATAGTAGTTCACGCGCGCCGCCAC 
30 GCAGCTGGACACGATGCGCTGCCCTGCGTCGTCGAGCGGCCCGCTCGCCC 
ACGCGGGCGCGACGCCGAGCAGCCCGGGGTAGCGCTCGTCGTGCCTCTTG 
CCGTTCGAGTCCGTCCACGAAAAATCGAAGGAGGCCGTGCTGCTCAGGGC 
20 GCAGCTCGCCGCGTAACGCAAGAAATCGCGCGCCAGCGCGCCGCTCGGCC 

35 

CGGGATCCTGGATCGCGGCGAGGTTCCGCGCGCTGAGGCCGCTCAGGTTC 
AGGGCGTTCAGGTTCAGGGCGTTGAGGTTCAAGGCGTTGAGGTTCAGGGC 
GTTCGTGCTGAGCGCGTTGCCGCCCACGAGGGCCCCCTGGGATTCCCCCA 
^ CAGGCTCGCCCCACGCATCGGCGTCCACCACCTCGGCGGCGCAGCCCGAC 

2 5 AGCACCCCTGCCCAACCAAGCACGATGAATGTCCGCTCGAGAGACATGGA 
TTCCCCCGTGTTCCTGGCGCATGAeCCGACGGCGCCCTGCGCGCGGCGCG 
CGCGGGCTCCCATCGATTCGCTC'GATGGGTTCAATATTCTACTTTTTCCC 
GCGCTCTCGCGCCGGTGAAAGTCGCTTCAGCGGCGGCGAGGTCGATGTCA 
GGAGCGTCCGACTCCGTCGCTCTCGTCAGCTCCGCGTACCAGCGACGGAG 

30 TCGCCCGCCCATGACGGTCGGAATGGTAGAGGCGGCCGCGAGGGCGCGCT 
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CGAGCTGCGCCCGGGCGTCGGCGCGCCGGCCGCGGCGCAGGGCGGCGAGC 
GCCCGCGCCTCGAGCACCTCGA7CCGCTCCTGCCCGACCGAGCAGCGCGC 
GGAGCGCTCCTCGAGCGCGGCCCACGCGGCGCGGTCGTCGTCGCGGGTCG 
CGAGCTCGATCATCGCGCAGAGCACGTCCTCCGAGGGCTTCAGCGCCTCG 
5 CAGCCGGCGTCGTCTCGCGCCGCGCGGAGCCGCAGCGCGATCCGGCGAGC 
^5 GCCCGCCTCGTCGCCCTGGTAGAGGCGCAGGCGCGCGATCAGGAGCGTTA 
CGACGACCGGCGCGTGGCGATCGCCGCAGCGCGGCGCTATCGCCTGGACG 
GCGCGCGCATGGGGCCTCGCGGCCGCGAGATCGTCCATCAGGTACAGGTA 
CTCGGCGAGGTTGTAGCGGCCGACGAGCTCGAACGCGGGCTGGCCGAGCT 
10 CGCGCCCGAGCGCGATGGTGCGCTCGAAATCGGCGATCATCCCGGCGCGA 
TCGCCCTGGAGCGCCCGCGCGAGCCCGCGGTTGTTGAGCGCGGCGCCGAG 
GTGCATGAGATCGCTGCGCTCCTCGCAGCTGAGGATCACCGCGTCGAGGT 
25 CTCGCGCCGCCTCCTCGACGCGGCCGAGGCTGGCCAGGATGAAGCCGAGC 
AGCAGCAGGGCGATGATGTGCGTCrCGTGGCCCTCGTCCCCGAGCCGCGC 
15 CGCCTGCGCCGCGGCGCGCGTCAGCACCGCGGCGGCCTCGTCCTCGCGGT 
CGGCGCGGTGGAGCGAGCGGCCCACGCCGAGGAGCAGGCGGGCGCCGAGC 
30 AGGGGCGAGGCCACCCC-GCCGGCGAGGCGCTCGGCGGCCGCGACCCGCTC 
GCGCGCGGCCCGGTACTCGCCCGTCCAGTCGAGGATCATGGCCTCGTCGA 
GGAGGAGCTCGATCTCGGCCCCCGCCTCCGACGCCGCCGCCGCCGCCTCG 
20 CGCGCCGCGGCGAGGTCGGCGAGGGCCTCGGTGTGGCGCCCGAGCCGGAA 
GCGAGCGAGGCCCCGCGCTCGGCGCTCCTCGGGGAGCAGCGCGCCGAGCA 
GCGCCTCGACGCGCCCGTAGCAGCCCTCGGCGTCGAGGTAGGCCCGGCGC 
GCGGCCGCGAGCTCGGCGCCGCGGGCGAGGAGCGACGCCGCGCGGGCGGT 
^ CAGGCCGCCGCGCTCGCAGTGCGCCGCGAGCACCAGCGGATCGGCCTCGC 

2 5 CCGCGGCCTCGAGCCAGTCGGCGGCGAGGCGGTGGCCGAGCGCGCGATCG 

GCAGCTCGCCGCGTAACGCAAGAAATCGCGCGCCAGCGCGCCGCTCGGCC 
CGGGATCCTGGATCGCGGCGAGGTTCCGCGCGCTGAGGCCGCTCAGGTTC 
AGGGCGTTCAGGTTCAGGGCGTTGAGGTTCAAGGCGTTGACGTTCAGGGC 
GTTCGTGCTGAGCGCGTTGCCGCCCACGAGGGCCCCCTGGGATTCCCCCA 

3 0 CAGGCTCGCCCCACGCATCGGCGT CCACCACCTCGGCGGCGC AGCCCGAC 
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AGCACCCCTGCCCAACCAAGCACGATGAATGTCCGCTCGAGAGACATGGA 
'T'TCCCCCGTGTTCCTGGCGCATGACCCGACGGCGCCCTGCGCGCGGCGCG 
CGCGGGCTCCCATCGATTCGCTGGATGGGTTCAATATTCTACTTTTTCCC 
GCGCTCTCGCGCCGGTGAAAGTCGCTTCAGCGGCGGCGAGGTCGATGTCA 
5 GGAGCGTCCGACTCCGTCGCTCTCGTCAGCTCCGCGTACCAGCGACGGAG 
t5 '^CGCCCGCCCATGACGGTCGGAATGGTAGAGGCGGCCGCGAGGGCGCGCT 
CGAGCTGCGCCCGGGCGTCGGCGCGCCGGCCGCGGCGCAGGGCGGCGAGC 
GCCCGCGCCTCGAGCACCTCGATCCGCTCCTGCCCGACCGAGCAGCGCGC 
GGAGCGCTCCTCGAGCGCGGCCCACGCGGCGCGGTCGTCGTCGCGGGTCG 

20 

10 CGAGCTCGATCATCGCGCAGAGCACGTCCTCCGAGGGCTTCAGCGCCTCG 
CAGCCGGCGTCGTCTCGCGCCGCGCGGAGCCGCAGCGCGATCCGGCGAGC 
GCCCGCCTCGTCGCCCTGGTAGAGGCGCAGGCGCGCGATCAGGAGCGTTA 
25 CGACGACCGGCGCGTGGCGATCGCCGCAGCGCGGCGCTATCGCCTGGACG 
GCGCGCGCATGGGGCCTCGCGGCCGCGAGATCGTCCATCAGGTACAGGTA 
15 CTCGGCGAGGTTGTAGCGGCCGACGAGCTCGAACGCGGGCTGGCCGAGCT 
CGCGCCCGAGCGCGATGGTGCGCTCGAAATCGGCGATCATCCCGGCGCGA 
30 TCGCCCTGGAGCGCCCGCGCGAGCCCGCGGTTGTTGAGCGCGGCGCCGAG 
GTGCATGAGATCGCTGCGCTCCTCGCAGCTGAGGATCACCGCGTCGAGGT 
CTCGCGCCGCCTCCTCGACGCGGCCGAC-GCTGGCCAGGATGAAGCCGAGC 
20 AGCAGCAGGGCGATGATGTGCGTCTCGTGGCCCTCGTCCCCGAGCCGCGC 

35 

CGCCTGCGCCGCGGCGCGCGTCAGCACCGCGGCGGCCTCGTCCTCGCGGT 
CGGCGCGGTGGAGCGAGCGGCCCACGCCGAGGAGCAGGCGGGCGCCGAGC 
AGGGGCGAGGCCACCCGGCCGGCGAGGCGCTCGGCGGCCGCGACCCGCTC 
40 GCGCGCGGCCCGGTACTCGCCCGTCCAGTCGAGGATCATGGCCTCGTCGA 

25 GGAGGAGCTCGATCTCGGCCCCCGCCTCCGACGCCGCCGCCGCCGCCTCG 
CGCGCCGCGGCGAGGTCGGCGAGGGCCTCGGTGTGGCGCCCGAGCCGGAA 
GCGAGCGAGGCCCCGCGCTCGGCGCTCCTCGGGGAGCAGCGCGCCGAGCA 
GCGCCTCGACGCGCCCGTAGCAGCCCTCGGCGTCGAGGTAGGCCCGGCGC 
GCGGCCGCGAGCTCGGCGCCGCGGGCGAGGAGCGACGCCGCGCGGGCGGT 

3 0 CAGGCCGCCGCGCTCGCAGTGCGCCGCGAGCACCAGCGGATCGGCCTCGC 



55 



wo 00/22139 



215 



PCT/US99/23535 



CCGCGGCCTCGAGCCAGTCGGCGGCGAGGCGGTGGCCGAGCGCGCGATCG 
TCCTTGGTGAGCTGCGCGTAAGCGCCCTCGCGCAGGAGCGCCTGGCGGAA 
GGAGTACTCCTCCTCGCCGGGGAAGCGGCCCTCGCGGTGGCGGACGCAGA 
GCTCCCCGGCGACGAGCGCGGAGAGGTGCTCCGCGAGCGGAGCGGCCTCG 
TCGCCCCCGAGCAGGTGCGCGACGGCGCCTCGCCAGAACACCTCGCCGAG 
CACGCTGGCGGCCCGCAGGATCCGGCGCGCGGGGGGCGCGAGCGCCTCCA 
GCCGGACCTGCACCATCGCCACCACCGTCTCGGGCAGCGCGTCGCCGCGG 
CCCTCCGCCGTCGCGCGGATCAGCTCCTCGAGGAAGAACGGCTGGCCCTC 
GGACTGGGTGACCAGACGATCGATGAGGGCCCCGTCGGCCGCGTCGCCCA 
GCGCCTCCCGCGCGAGCTGCGCGCACGCCCTCGGCGGGAGCTGCCTGAGC 
CAGAGCTCCTGCCGCCCGCGCTCGGCCCAGAGATCGGGGTACGCTTGC, 

or their complementary strands, 

(b) DNA-sequences which hybridise under stringent conditions 
tc regions of DNA-sequences according to (a) encoding proteins 
or to fragments of said DNA-sequences, 

(c) DNA-sequences which hybridise to the DNA-sequences accord- 
ing to (a) and (b) because of a degeneration of the genetic 
code , 

(d) allele variations and mutants resulting by substitution, 
insertion or deletion of nucleotides or inversion of nucleotide 
segments of DNA-sequences according to (a) to (c) , wherein the 
variations and mutants offer isofunctional expression products. 

12. Peptide encoded by a DNA sequence according to claim 12 
selected from the group consisting of 
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Seq ID No 92 

>Concigll_002 591 amino acids MW=63639 D pl=5.80 numambig=0 

MLDVWSTSDQVACRLHCAGAGPSASLELRYDASAGARRDAERLAERLAALLEDLSRHPER 

PVAQGEYVGPGERAEIEAWSRGPAMELPSACALHRWFEERAEQHPDWAVRSEGKSLTYG 

ELERRANRLASCLRRRGVGLDTIVGVCVPRSEDMWATLAVLKVGGAYLPLDHEYPGERL 

AFMMRDARARLLVTHDAIADELPTGGWTTLLLDAEAAEIAACSDARPAVSPPPDSGAYVI 

YTSGSTGTPKGSLISHRAIVNQMQWIQRYWALTADDRVLLKAAFGFDVSVWEIFWPLSFG 

ARIWARAGGHRDPEYLRRLVRDEGATTAYFVSSMLAAFLGGPEQPFPASLRKVLVGGEA 

VPLDLVRRFYAKHDGDLINMYGPSSAAIAVTGCVLPSDPRVTWVPLGAPVANAEVFVLIX3 

AMRRPAIGALGDLYIAGAPLARGy/GQPGLTAERFLPDPCARAAGGRMYRTGDVARFLPD 

GMLEFQGRSDHQI KLRGHR I ELGDVEAQI RRVPGVGOAAWLREDAPGDARLVAYWLDG 

DAAGDAPDVRAGLKASLSAYMIPSSWRLYALPMCSERLAFTGSSYAGCLL* 

Seq ID No 93 

>Contigll_007 361 amino acids MW=38862 D pl=10,42 numambig=0 

MSDHEMTGFSLSPQQRAIRALDREAGAPGCRTLAWAVTGPCDEGRLSAAALALAERHEI 

LRTRLVEAGRARPRRWSASRASRGRQQDDWVGCSEAEQGERMSRLVARLSEDRGADDGLR 

VGLVRVGPEERRLVLAAPAWCVDESSIAPLVRSLCASTAGAGAPPEQQYADVAEWLNGML 

ESEDAGDGRRFWAERRSHFGPPLHLAFSRGGAGAGAGSGRARVDLGQGGMAQVERWSSSW 

QVPQRIVLLALWASLLWRMSGGNEPEVTVAVRFDGRSLDALAGAVGPFARFLPVRIEISA 

SDTLADVARRLALAEAEAAAHQDAAPGVSHRMSWGiiLRRGGRAGAVARRRAGPRARRLEH 
V* 

Seq ID No 94 

>Contigll_012 882 amino acids MW=95015 D pl=rl2.69 numambig=0 
MARALYAQEAAFRSAFNACTU^RARGLDLGEWFREGEGDGRTLLGTALAQPALFAVEL 
ALARLWMSWGIEPAAMIGKSLGELVAACVAGVFTLEDAMSLVIDRGRFMQAAPAGSMLAV 
GLPAAD VEGLLEAGLE I AAVNS PKLTWAGPASAI RDLAARLEAREVFARPLQTSHAFHC 
ALIDGAVAPFLESVRRARLSPPEIPWSNn/TGALLTDAEATDPAYWARHLRQPVRFSDGV 
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EALFASGHALFLEVGPGRC-LTTLVRQTlAGRGGAAIASLGSiKAAS2PASLAEALGQLWE 
AGHAVDWTARPRGRPFARSSACRRTRSSGRGTGSRRAAAPPPPSRRRPTRPSRPRRTPSR 
RRRSPARTPGPRSPRRRPSPSARSPAIWERPPRRPRDRRPRRLLRPARRLARRDPGDVPD 
PRAARRRAPAAGALPGADGRGARGPGRRRAPRRGAAARVPPDRADPPRRPAPAWRAAPA 
WRRSARAGQPGVQHPVRGARDGPARRRRAPPOPVRDRAAPRGAADDVQRQGRRAVPGRR 
ARGPGAVPDVRPRAPRGRGPRRGGLGARPRGEPRAVRPEPRAAPPRAGHPEAPRRARHRA 
RRAPRRVRRLVGRRLRRRARRALRRLRAGPAFPAAGAPGAVRRLRRSPAGVAVRRGPGGR 
APVLDDEALGRAAPGPRAGRPRAGGAKDVARGAAEPRRRGRAHPPDQGVLRAGGDLPLHG 
PARRVQARPAPAHRARGPRRGHRRREPEPRRDRADDRLLRQPARAPDRLRGRPDVRRAGP 
PRARRRARGVRAPGPAVRPARRGPAAEGGRRAPAPLRREVRHAQRARPPDEARGARARGA 
GGRGDHDGVRLRPHGRRGRRLVPLRRRAQQRAVFGGDGRQFP* 

Seq ID No 95 

>Contigll_021 1213 amino acids MW=i31017 D pl=12.40 numambig=0 

MRGRRRRAAPHLRGARPARRGDRRGGLALRSDRERARRRGHGEGLGAGGRRARRAARRRR 

LPAPGPSPPRGAPQAPPGARRGPPRPHAVGGRRDDRLARGHRAPRRRRGRAVARAARRAP 

AARRKHGRPGLRDLHVRVHGAAQGRDDRPPRRGEHGPRHQPPLRRRPGGPGARALVAELR 

PVGLRRVRDARRRRRRRDPRPYPRLGSGAIiARARGARAGDRVELGPGADGDAHGRVPRRG 

RPGAWAPPRHDERRLDPAEAPRSHPRGLPRAPRREPRRRDRGVDLVDRPPDRGRRPGVA 

QKPLRPPAREPAHLRARRGAGAVSDRGPRRDPHRRDRRRARTLARRGADPGAVPEAPHDR 

RAAVQDRRPGPLLRRRDHRAARAHRPSGEDPRLPHRARRDRGRPRAAPLGRAGGRGGEDR 

SVGREAFGRVRRRRRRRWRRAPRLRPEEAAGVHDPRGGRRPPGAAAERERQGGPRRPAGS 

RRGRAARRGRRPADRDRAAHRLGARGGLAGRGRRRDRQPLRARLHLAAPRARAAPARRAH 

RGARARRGRRGAGRVADRPVPVPDHRATGAAARRGDGQGRAGRRRGAARRGAPRRATPPG 

ARIAMSEPIETEDGGSDIAIVGI^GRFPGAPSVDAIiWEbrVRRGVESIARFPESEREEPPV 

GASAAPGAPWCAGGLLDDIDRFDASYFGYSPREAQLMDPQQRLFLECAVAALEDAGCDP 

ARFPGAIGVFGGCGSNTYLLQLLSHPDIJ^TVDPHALMLASEKDYIATRVSYKLJDLHGPS 

WTQTAC3TSLVAVHMACESLLGGQCDLALAGGVSIGIPQKRGYPYVPGSICSPDGRCRP 

FDARAEGTVGGSGVGIVALKRLADALRDRNTVHAVIRGSAVNNDGGRKVGFMAPSVDGQA 

AA I SEAQ3 VAGVDPGS IGYVEAKGTATAIGDPI EVEALTQAFRRKTPRKAYCALGS I KAN 
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IGHLDAAAGVAGLIKAAHWRSGEIPPCVHFEAPMPKLDLAASPFFVPREAAPWPRELRP 

RRAGVS3FGIGGTNAHVVLEEPPPLPPRAPAPERDHVLTLSARTPEALSTACAQLAAHLE 

ATDVPLDDVAFTLQTGRAEHPYRRAWARTRAEAIQGLAREGASALARPDEPRPSSRSRA 

RARRPSGWPARSTRRRRRSGAPSTRARRRRGRAASrSARSSSARARATGARCSAPRWRSP 
RSSPSSSRSPGSG* 

Seq ID No 96 

>Concigll_026 3079 amino acids MW=332984 D pl=:5.97 numambig = 0 

MLTWDHHVWEYWSFALIVRELGELYSALRAGRPPQLPPPSSFFAAGVSCPSPREAAGG 

AEYWRKALDGATTAIDLPRDRARHDAGARRGRAHAITLPKPLTGALARLARERGTTLFSV 

LLSALTVLLHRASGQSDLWGVPSAGRHDDESARAFGYFVQMLPVRVALRGAASFDALVA 

RVRDAFIJDALAHGDSALRKLLARAQGEAQRDALFDVAFAFQSTPPSLDARLSALAIGVGD 

VRIAQGELELTTIADEQAAAEFDLALFAAELDAGIALRFEYDQQLFDPATIERMARHFVV 

LLESAVEHPGRPLSELRMLSDAERALLLDDWSGAAAARQAASAPAPACVHALFEAHAARQ 

PDATALEFGHQRFTYAQLSTWSTELALWLRDRGVGPGSWGVCIERSPRMVAAQLAVLKA 

GAAYASLDPANPPARLAEMI^CRASIJU.TSSQASHKLTAAPCPVHLVQDGACAPSTHIP 

LV3RPDDLAYVLFTSGSTGTPKGVCVRHASLSRLVSFLHLRLDLSPSDRWTQVASSGFDA 

S\nfEIWTPLACGAALLLADDDALRSPTALVSWLVAQRATLSFMPTPLAEACFEQDWTGSA 

LPJIlMTVGGDKLHPLRRPLPFRLFNMYGPTEATVITTVAEIADLGAEPPLGRPVDSALVYV 

LDPHMQPVPPGALGELYIGGACLAOGYTRTDLTAERFLPDPFGOPGARLYRTGDLVRWRP 

dgqlafagrrdeqvklrgrrvelgevesvlrrlpgvregiwlhgqgsaahliahwpda 
hppserdlregmarlvpdalvpahfvllpalpmslsgkvdkkllpappaahadyeppsge 
lelelahiwqsvlhldrvgrhdsffdlgghsllamqvlgriesslgirttlrtlfehptl 
aqlashlssgaastsaaaatalergltrpdgpssprvatpeepfaltegqramwlecqks 
adgalynlgrtvrlgagvdvaalrrafeglverhealrttfltrdghplqqvhrhvalew 
aespamaldereivaradevrrrafdlergpllrvhvwrrgegqppll'iwvhhlwdyw 
sfallvrelgelysalragrppqlpppssffaagvscpspreaaggaeywrkaldgatta 
iciprdrarhdasprrgrahaitlpkpltgaiarlarergttlfsvllsaltvllhrasg 
qn-olwgvpsagrnddestrafgyfvqmlpvrvalrgaasfdalvarvrdafldglahgd 
sa^qhlij^prgaarrggalfdvafafqgalpsijdprlaalttgaedvriaqgelelttl 
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ADEQAAAEFDIALFAAELDSGIALRFEYDQQLrDPATIERMARHFVLLLESAVEHPGRPL 
SELRMLSDAERALLLDDWSGAAAARQAASAPAPACVHALFEAHAARQPDATALEFGHQRF 
TYAELSTWSTELALWLRDRGVGPGSWGVCIERSPRMVAAQLAVLKAGAAYASLDPANPP 
ARLAEMLADCRAALVLTSSQASHKLTAAPCPVHLVQDGACAPSTHIPLVSRPDDLAYVLF 
TSGSSGTPKGVCVRHASLSRLVSFFQHLLALSPRDRWTQLASSGFDASVYEIWTPLACGA 
ALLLADDDALRSPT/U^VSWLVAQRATLSFMPTPLAEACFEQDWTGIALRAMTVGGDKLHP 
LRRPLPFRLFNMYGPTEATVITTVAEVADLGDEPPLGRPIDSALVYVLDPHMQPVPPGVL 

gelyiggaclaqgytrtdltaerflpdpfgqpgarlyrtgdlvrwrpdgqlafagrrdeq' 

VKLRGRRVELGEVESALRRLPAVREGWVLHGQGSAARLIAYWPGADPPSERDLREGMA 

rlvpdalvpahfvllpalpmslsgkvdkxllpappaahadyeppsgelerelahiwqsvl 
hldrvgrhdsffdlgghsllamqvlgriesslgirttlrtlfehptlhqladrlssgaas 
ttaaaatvpaseiapslgrapadepyplsyeqerlwvleqllpggtaynwoavrlrnlv 

DVDALSSAIiAALVRRKWSLRTVFVASPTPAQKICEPEAAPAEVVDLRGTPPDEAEAAARA 
WASREQATGFDLARGPVFRARLFRLDHDVCVLVLSTHHIVTDAWSFQPLVRDLAELYRRA 
RGGGPADMPELPLQYVDFAWQRRKIAGKRLADKLAHWTATLRGLPVLELQTDRPRPPVQ 

tfrgaervlpldarlvaqldeiju^srgatrfn^laalgvllrrssgqddliaigtavanr 
prpeleplvgffvntivmrldlggdptfeellsrarkvaleafehqdapfekweavnpr 
rdlsrsplfqvmlwqnaptealelgevriepldlpveatrfdlrfsveprggrdvislq 
YNVDLFDAAT I drmlatmqsvlsratqdpaqrvrals vapedreralvawndtavatpdh 
lrleepfferaveqpdacawdaerrltygelarraeaiaaaasrsgatanalvawmek 
gweqvaavlgvlragaaylpldprlpeerlrhllehaevrlvltqsavdgtiawpagier 
lavdaderwreqpvarrppggstddlayviytsgstglpkgvmidhrgavntvldinrrf 
dvgpedrvljulsslsfdlsvydvfgtlaaggawipdrtrasdpghwrelverervtvwn 
s vpalmemlmdas pgagdp AL S SLRLVMMSGDW I PLKLPDR I raac rapr wslggatea 

S I WS I AHP I ADVDPAWRS I PYGRPLANQHTYVLDEGLEPCP IGVPGE I H I GGI GVALGYW 

rdeartrerflkhpttgerlyrtgdlgryfadgtiellgrtdhqvkirgfrielgeieaa 
laqhps^/eqawaaktdpsgekrlvaywgadgdgaalrdfvrkklpeymipabvvvlpa 
lplsakgkvdraalpdpaavapraaavaprtaterliasvlaevlqveavgvtdnlfelg 

FTSLLLVRAQRLLAERIAARAPDEGAAAQAVSLTDLFQYPTIEQIoAQRLDAATVKAEPAD 
VGAQRAEARRDARRRRGRG * 
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Seq ID No 97 

>Contigll_011 544 amino acids MW=60164 D pl=9.10 numambig-0 

MMSRIRAQLGVELPLRALFQGPTVAALAAQVDAARRGEARRREFPPIARIPRDGPLPLSF 

AQHRLWFVDQLEPGSPAYNIPFWRATGRLDVDALRRSLFEIARRHEALRTTFSARDGVP 

PPWAPEARVPFRMSDLEHLAGEALDAAVSALVLEESLAPFDLSRGPLLRVRVIRKRHDE 

HVIALVVHHWFDVWSVGVFVGELAALYGGFAQGQPSRLPELPAQYVDFAAAQRAWLSGE 

VLEGELRYWTTKLSGALRRARVPVDHEPAGRRTt^RGARRSLDAGAELTRQIKAFCEREAI 

SPFMALLAAYKLVLHQRTGLEDLWGTDVANRNRVETEPMIGFFVNQLVLRTDCGGDPTF 

GALVRRVRDVALEAFEHQDLPFDRLVEALRPKGAVGHVPLFDAKFVMRNVHVPPMKLEGL 

ELEALEGEATTTAFDFVLTVAEAGGSFRFGVEHSSELYRAATVDNFLSDYRQILATATAR 

PDTPVSELRGELERAAAARRELERKAARGAALDKLTSARRRAVTLPRPGAPGEAKTSPKD 
DLDE* 

Seq ID No 98 

>Contigl2_001 514 amino acids MW=56145 D pl=8.82 numambig=0 

PPAVRRYVADRRPEQLPALAPEEREAAARRLSALiGAAPPQVRRRGLTRAPLSYGQSRIYF 

LEQLSPGKPLFNVPGAVRLRGPVDVARLSAAFGEIVRRHDALRTSIANVDGELLQIAQPH 

AGFAUJVVTSTPEEAAELDRRLRAEAWRPFAIGAPPLLRATLFRIJ^DEHVLLVTMHHVV 

SDDWSLGVILRELLAIiYAGRSLPPPRLQVSDFAAWQREMVESGALDGQRAYWRERLRGLS 

RASISAGGGAEAPSHDPSGAIEEIALSPDKAAALEALARREGATLFMVLLALLDLVIHAR 

SGALDIAVGTPIANRNRPELEDWGLLTNTLVIRVDLARAGAFRDVLARARVQALDAFAN 

QDIPFDWTQDLKQERDHAQHPLFRVWLALQNAPKPALEVRGLRVEPLPLRPELVHFEVA 

LLLWPADDGSWGHFEFRRDRVDEGARKEIAAAFTHLVDAVIARPDAPVSTLVEGARAEA 

ARAQAALGEAFARAATARLGQLRRRS AGDRTPRE * 

Seq ID No 99 

>Contigl2_009 582 amino acids MW=65555 D pl=8.72 numambic=0 

MREPSSTPNWRNFGSNLPAGSDSVPPGEGFPIKKILALNLGKWKDTAGLQIAQALHLFEY 

GYKRYREGKFVLRATSDLGLGAIFESIDNWESFDQFEEFFKPWTFIRKPLVATRWAEDAE 
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FGRQRLVGINPAHIRRATPADLADFV3GAEPKPIAIADGRTLEEVREGGQLYFLDYRIFK 
DIVDTDVOEELGKYPLAPTCMLHQTAAGELLPVAIRLVHSRPGKGAHPDKIFTPSGPSDD 
WLTAKIAVASADAIYQGQVTHLLYAHLIVEPFAVSTYRNLPATHPLHQLLRPHFFNTLAI 
NELARRRFLGRGRFFDITSSVATMGSFELLTRAYTGKGIKGYGGKPWRFYESALPRDLSA 
RDVRDLVGYHYRDDALLHWDAIQEYVGQVLKIAYPTPGSLSSDASLQRWIHELVSPQLGG 
MDSLLPFERADQLEKLTSLDDLIAIVTNIIFTATAYHAAVNFGQTDYYTWIPNAQFATYR 
SYGDVLNGSEKRQFKPLERLPGRAQSIRQMVLSRSLSMGPPLTSESLMTMKCLLQDPAAK 
QAFAR YRERLAH I ERE I TERNRAREQP YLYLLPSMVPQSVAI * 

SEQ ID No 10 0 (>0RF1) 

VSQRTSCYLRGGGVCSMNDAFLALERNERNRPSTVIDLLRQRAEAEPARPIYCFLESGDVEAG 

ATWVTLREIDERARTVAALLQASGVAPGARALLLYPPGIEYITAFFGCLYAGVRTVPAYPPDL 

GRLERTLPRVASIVADARAEAALTSSAVAGIVASLPASAAAAALQRLRWIATDGPSPGPIEGP 

GAALRPESVAFLQYTSGSTGEPKGVMLTKGNLIoHNSRLIAHGFDLTSPDPVGVIWLPPYHDMG 

LIGGILQALYRRIRVALMSPLSrLQRPMRWLRAVSALGASVSGGPNFAYDLCVRKSSEEERAA 

LDLRSWEVAFTGAEPVRADTLDRFARAFAVSGFRREAFYPCYGLAEATLIVSGGARAEAPVLA 

RLAPEEVELGRAVASAAEGARVFVGSGRALDPRAVAIVDPAGNELGPGEIGEIWVSGPSVAVG 

YWGRPEETEATFGATLAGSAAPRYLRTGDLGFLRGGELFWGRSKDLIILRGRNHFPQDIEKT 

VESSHRAVRPGCSAAFSVEHEGEERLAWCEVDPRVAADPREIVAAREAVTAEHQLVAHAVAL 

lAPGALPKTSSGKVRRRECRRAFLEDALGERHVAFAPELLDDASPPDDAPPETEEPSGRSLLD 

ALRSTLARALRLDAGQIDDAIiPISRFGLDSLAAVELQHAFQVRTGRAIPLTSILRGGSLRLTR 

EITRLDGPSSPRVATPGGAVCADRWGTGRFGSSAISRPMERFTTWAGRSGSVPAFKRVDLRRA 
F 

SEQ ID No 101 (>ORF2) 

VYSSAYVLFAVCAGTTRVASAPETAGFPLECVGDDGTVLGPDSFWGYTQVYVFKKERLNTNP 
PIDGFTL.KLDGNEVAPGEDGLPWKRCVRSEEQACGCGRTEPAEDECTTYEIEAWPEKAAEV 
DEEAAGLGGPPAREAIWVDYYTDGGEFDGARRLVSDTTGASRGGNGTTWTPPSEPGRVSLWAV 
VHDTRGGASVTRREVQVE 
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SEQ ID No 102 ;>0RF3) 

WGTVLSAGTGEPLPDIAVTLVRPDGGREEAKTDQGGKFRFKNLPPGKYRVEVAAAGFEPFAA 
E EE T AAGEA T EVR YR I SLAAPQDGKAPGI E VTVQGERPPRE VTRRT T ERRE I DR I PGTGGDAL 
RSLQSLPGVARSGFGLLIVRGSAPQDTLTFVDRTPVPIIYHFGGLSSWPTEMLEKIDFYPGN 
FSAVYGRAMGGIVDVGLRSPKQDGKYHGWQLDLIDGRVLLEGPVPFLKDWTFIAAGRRSWVD 
AWLGPVLKEAGSSVTQAPVYYDYQFVLEGRPSASERVRASFYGSDDAFKITLDKPPEDEPALT 
GDFGLHTAFQRFQLSYENRIGSRDRLLWS^4ALGRDIADFEISFLAFNWSTSLDLRLELSHRF 
ARYLTMNVGTDLSGGVATVNIRAPSQQPAGHPSNQPFSTYPFQDRSFDGAYSRPAAYAET..EW 
PSPRARIVPGVRVDYALDTQTLDVSPRVNARYDIRSGFPRTTAKGGVGLYYQAPQFAESIEPF 
GNAELKSNRAVHYGLGVEQEITPQIEVTLDGFYKQLDRLWFSPEKDDYADGTGYAVGGELLL 
KYKPDERFFGWAAYTLSRSVRKDGPDEEEHLTQFDQTHVLTVLGSLRLGRGWELARFRLVSGN 
LQTPYVCDPEEKGCNPNRVNAIYHASSARYSPIPLGGDYSERMPLFHQLDIRADKTWKFKRWQ 
LGLYLDIQNVYNYMAAEGISYNFNYTKREYVTGLPFLPTLGLRGDF 

SEQ ID No 103 (>0RF4) 

VIAVDNNPEAVDAVKDKTSAAFVGDATVHKVLEGIGAQYVETAIVTFGEHFEPSVLCVASLVR 
MGVRI I ARAATDRQADI LRAVGATRVIQLETEMGRRVGADITMPIAQDLLDLASHYRWPWNA 
HGPLVGQTLAGSKIRQRYRINVLGVRPHTNKRPGDKPRLEAPTPDYVIRDGDTLLLVGDSDDV 
SRFVAEVGG 

SEQ ID No 104 (>0RF5) 

SGSSGGGSSAEGSRCQPSGGGPHWLLEGETVTFPVTCASGLAIiAGDAFEVGPLPEGAAYDPIA 

REVTFSPGLDQAAVYDIEIRVAQTSEVGRVKVGVADAFADPSNVPWDPTRYPEEYGLPVLFL 

SPVPEDKEYAPATVIYRGHTYAAEAELRGESSLSYPKRSYTLKFPKDDKFNEPDEAGGFTDRR 

KWLITTFDDNSYVRQRLAYDLWNRLDFEHIQIKTYSAVLYLDGEYAGLYTVADHVDGYLMED 

HGYPQDGNLYKAVSHDANFALTDRSGDPKDTUIDGFEKKEGAPAEGEPEAFSDLEDLVSFVAE 

SDDATFAAEIGSRIDLRDYEDWWIFVTFIVANDSAGKNSYHYRDPAADGVFRYAPWDFNASFG 

QSWETEREPASDRVDYRDVNLLFERLLEEPSIGDPLRARYDQVLRGALAEAEIHAIVDGYVER 

IDASARRDEARWGEAYRSYEGWSWRDDFTTYEEEIAYLK 

AV7ISERWQHQDELY 
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SEQ ID No 105 (Contig 11 >0R?1) 

VLDVWSTSDQVACRLHCAGAGPSASLELRYDASAGARRDAERLAERLAALLEDLSRHPERFVA 

QGEVGPGERAEIEAWSRGPAMELPSACALHRWFEERAEQHPDWAVRSEGKSLTYGELERRAN 

RIASCLRRRGVGLDTIVGVCVPRSEDMWATIAVLKVGGAYLPLDHEYPGERIAFMMRDARAR 

LLVTHDAIADELPTGGWTTLLIiDAEAAEIAACSDARPAVSPPPDSGAYVIYTSGSTGTPKGSL 

I SHRAI VNQMQWIQRYWALTADDRVLLKAAFGFDVSVWEI FWPLSFGAR T WARAGGHRDPEY 

LRRLVRDEGATTAYFVSSMLAAFLGGPEQPFPASLRKVLVGGEAVPLDLVRRFYAKHDGDLIN 

MYGPSEAAIAVTGCVLPSDPRVTWVPLGAPVANAEVFVLDGAMRRPAIGALGDLYIAGAPLAR 

GYVGQPGLTAERFLPDPCARAAGGRMYRTGDVARFLPDGMLEFQGRSDHQIKLRGHRIELGDV 

EAQIRRVPGVGQAAWLREDAPGDARLVAYWLDGDAAGDAPDVRAGLKASLSAYMIPSSWR 
LYALPMCS ERLAFTGS S YAGCLL 

SEQ ID No 106 (Contig 11 >0RF2 ) 

MSDHEMTGFSLSPQQRAIRALDREAGAPGCRTLAWAVTGPCDEGRLSAAALALAERHEILRT 
RLVEGRARPRRWSASRASRGRQQDDWVGCSEAEQGERMSRLVARLSEDRGADDGLRVGLVRVG 
PEERRLVLAAPAWCVDEESIAPLVRELCASTAGAGAPPEQQYADVAEWLNGMLESEDAGDGRR 
FWAERRSHFGPPLHLAFSRGGAGAGAGSGRARVDLQGGMAQVERWSSSWQVPQRIVLLALWAS 
LLWRMS GGNEPEVTVAVRFDGRSLDAIiAGAVGPFARFLPVRI E I SASDTIJUDVARRIAIJ^^ 
AAAHODAAPGVSHRMSWGLLRRGGRAGAVARRRAGPRARRLEHV 

SEQ ID No 107 (Contig 11 >0RF3) 

MSRIRAQLGVELPLRALFQGPTVAALAAQVDAARRGEARRREFPPIARIPRDGPLPLSFAQHR 
LWFVDQLEPGSPAYNIPFWRATGRLDVDALRRSLFEIARRHEALRTTFSARDGVPFPWAPE 
ARVPFRMSDLEHLAGEALDAAVSALVLEESLAPFDLSRGPLLRVRVIRKRHDEHVIALWHHV 
VFDWJSVGVFVGELAALYGGFAQGQPSRLPELPAQYVDFAAAQRAWLSGEVLEGELRYWTTKL 
SGALRRARVPVDHEPAGRRTWRGARRSLDAGAELTRQIKAFCEREAISPFMALLAAYKLVLHQ 
RTGLEDLWGTDVANRNRVETEPMIGFFWQLVLRTDCGGDPTFGALVRRVRDVALEAFEHQD 
LPFDRLVEALRPKGAVGHVPLFDAKFVMRNVHVPPMKLEGLELEALEGEATTTAFDFVLTVAE 



wo O0/22I39 



224 



PCT/US99/23535 



AGGSFRFGVEHSSELYRAATVDNFLSDYRQILATATARPDTPVSELRGELERAAAARRELERK 
AARGAALDKLTSARRRAVTLPRPGAPGEAKTSPKDDLDE 

SEQ ID Xo 108 (Contig 11 >0RF5) 

M3EPIETEDGGSDIAIVGMAGRFPGAPSVDALWENVRRGVESIARFPESEREEPPVGASAAPG 
APWCAGGLLUDI DRFDAS YFGYS PREAQLMDPQQRLFLECAVAALEDAGCDPARF PGAI GVF 
GGCGSOTYLLQLLSHPDI^TVDPHALMIASEKDYIATRVSYKLDLHGPSVWQTACSTSL^^ 
VHMACESLLGGQCDLAliAGGVS I G I PQKRG YP YVPGS I OS PDGRCRPFDARAEGTVGGSG VG I 
VALKRLADALRDRNTVHAVIRGSA^/NNDGGRKVGFMAPSVIXK^AAAISEAQSVAGVDPGSIGY 
VEAHGTATAIGDPIEVEALTQAFRRKTPRKAYCALGSIKANIGHLDAAAGVAGLIKAAHWRS 
GEI PPC\r-iFEAPNPKLDLAAS PFFVPREAAPWPRELRPRRAGVSS FGI GGTNAHVVLEEPPPL 
PPRAPAPERDHVLTLSARTPEALSTACAQLAAHLEATDVPLDDVAFTLQTGRAEHPYRRAWA 
RTRAEAIQGLAREGASALARPDEPRPSSRSRARARRPSGWPARSTRRRRSGAPSTRARRRRGR 
AAS I SARS SSARARATGARCSAPRWRS ?RS SPSSSRSPGSG 

SEQ ID No 109 (Contig 11 >0RF6) 

WDHHVWEYWSFALIVRELGELYSALRAGRPPQLPPPSSFFAAGVSCPSPREAAGGAEYWRK 
ALDGTTAIDLPRDRARHDAGARRGRAHAITLPKPLTGALARLARERGTTLFSVLLSALTVLLH 
RASGQSDLWGVPSAGRHDDESARAFGYFVQMLPVRVALRGAASFDALVARVRDAFLDALAHG 
DSALRHLUUIAQGEAQRDALFDVAFAFQSTPPSLDARSALAIGVGDVRIAQGELELTTLADEQ 
AAAEFDLALFAAELDAGIALRFEYDQQLFDPATIERMARHFWLLESAVEHPGRPLSELRMLS 
DAERALLLDDWSGAAAARQAASAPAPACVHALFEAHAARQPDATALEFGHQRFTYAQLSTWST 
SIJU^W^JlDRGVGPGSWGVCIERSPR^fVAAQIAVLKAGAAYASLD?ANPPARLAEMIA^ 

laltssqashkltaapcpvhlvqdgacapsthiplvsrpddlayvlftsgstgtpkgvcvrha 
slsrlvsflhlrldlspsdrwtqvassgfdasvyeiwtplacgaallladddalrsptalvsw 
lvaqratlsfmptplaeacfeqdv;tgsalramtvggdklhplrrp?frlfnmygpteatvitt 
vaeiadlgaepplgrpvdsalvyvldphmqpvppgalgelyiggaclaqgytrtdltaerflp 

DPFGQPGARLYRTGDLVRWRPDGQLAFAGRRDEQ VKLRGRR VELGEVES^/LRRLPGVREG I w 
UiGQGSAAHLIAHWPDAHPPSE?J)LREGMARLVPDALV?AHFVLLPALPMSLSGKVDKKLLP 
APPAAHADYEPPSGELELELAHIVrQSVLHLDRVGRHDSFrDLGGHSLLAMQVLGRIESSLGIR 
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TTLRTLFEHPTLAQLASHLSSGAASTSAAAATALERGLTRPDGPSSPRVATPEEPFALTEGQR 
AMWLECQKSADG 

A1YNLGRTVRLGAGVDVAALRRAFEGLVERHEALRTTFLTRDGHPI.QQVHRHVM.EWAEEPAM 

AIDEREIVARADEVRRRAFDLERGPLLRVHVWRRGEGQPPLLTVWHHLWDYWSFALLVREL 

GELYSALRAGRPPQLPPPSSFFAAGV3CPSPREAAGGAEYWRKALDGATTAIDLPRDRARHDA 

SPRRGRAHAITLPKPLTGAIJUiIARERGTTLFSVIJ.SALWLIJmSGQNDLWGVPS 

DESTRAFGYFVQMLPVRVALRGAASFDALVARVRDAFLDGLAHGDSALQHLLAEPRGAARRGG 

ALFDVAFAFQGALPSLDPRLAALTTGAEDVRIAQGELELTTLADEQAAAEFDLALFAAELDSG 

lALRFEYDQQLFDPATIERMARHFVLLLESAVEHPGRPLSELRMLSDAERALLLDDWSGAAAA 

RQAASAPAPACVKALFEAHAARQPDATALEFGHQRFTYAELSTWSTELALWLRDRGVGPGSW 

G VC I ERS PRMVAAQLAVLKAGAAYAS LDPANPPARLAEMLADCRAALVLTS SQASHKLTAAPC 

P^/HL VQDGACAPS TH I ? L VS RPDDLA YVLFT SGS SGTPKGVCVRHAS LS RL VS F FQH LLALS P 

RDRWTQLAS SGFDAS WEI WTPIACGAALLIiADDDALRS PTALVSWLVAQRATLSFMPTPLAE 

ACFEQDWTGIALRAMTVGGDKLHPLRRPLPFRLFNMYGPTEATVITTVAEVADLGDEPPLGRP 

IDSALVYVIJ3PHMQPVPPGVLGHLYIGGACLAQGYTRTDLTAERFLPDPFGQPGARLYRTGDL 

VRWRPDGQIAFAGRRDEQVKLRGRRVEIXSEVESALRRLPAVREGVVVIiHGQGSAARLIAYVVP 

GADPPSERDLREGMARLVPDALVPAHFVLLPALPMSLSGKVDKKLLPAPPAAHADYEPPSGEL 

EREIiAHIWQSVLHLDRVGRHDSFFDLGGHSLIJ^QVLGRIESSLGIRTTLRTLFEHPTLHQLA 

DRLSSGAASTTAAAATVPASEIAPSLGRAPAD 

EPYPLSYEQERLWLEOLLPGGTAYNWQAVRLRNLVDVDALSSAIJU^LVRRHWSLRTVFVAS 
PTPQKICEPEAAPAEWDLRGTPPDEAEAAARAWASREQATGFDLARGPVFRARLFRLDHDVC 
V1.VLSTHHIVTDAWSFQPLVRDLAELYRRARGGGFADMPELPLQYVDFAVWQRRHLAGKRLAD 
KJ^WTATLRGLPVLELQTDRPRPPVQTFRGAERVLPLDARLVAQLDELARSRGATRFMVLLA 
ALGVLLRRSSGQDDLAIGTAVANRPRPELEPLVGFFVNTIVMRLDLGGDPTFEELLSRARKVA 
LEAFEHQDAPFEKWEAVNPRRDLSRSPLFQVMLWQNAPTEALELGEVRIEPLDLPVEATRF 
CLRFSVEPRGGRDVISLQYNVDLFDAATIDRMLATMQSVLSRATQDPAQRVRALSVAPEDRER 
ALVAWNDTAVATPDHLRL5EPFFERAVEQPDACAWDAERRLTYGELARRAEAIAAAASRSGA 
TANALVAWMEKGWEQVAAVLGVLRAGAAYLPLDPRLPEERLRHLLEHAEVRLVLTQSAVDGT 
I AWPAG I ERLAVDADERWREQ PVARRPPGGSTDDLAY VI YTSGSTGLPKGVM I DHRGA VNTVL 
DINRRFDVGPEDRVLALSSLSFDLSVYDVFGTLAAGGAWIPDRTRASDPGHWRELVERERVT 
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WNSVPALMEMLf^ASPGAGDPALSSLRLVMMSGDWIPLKLPDRIRAACRAPRVVSLGGATEA 

SIWSIAHPIADVDPAWRSIPYGRPLANQHTYVLDEGLEPCPIGVPGEIHIGGIGVALGYWRDE 

ARTRERFLKHPTTGERLYRTGDLGRYFADGTISLLGRTDHQVKIRGFRIELGEIEAALAQHPS 

VEQAVVAAKTDPSGEKRLVAYVVGADGDGAALRDFVRKKLPEYMIPAEWVLPALPLSANGJC/ 

DRAALPDPAAVAPRAAAVAPRTATERLIASVLAEVLQVEAVGVTDNLFELGFTSLLLVRAQRL 

LAERIAARAPDEGAAAQAVSLTDLFQYPTIEQLAQRLDAATVKAEPADVGAQRAEARRDARRR 
RGRG 

SEQ ID No 110 (Contig 12 >0RF1) 

PPAVRRYVADRRPEQLPALAPEHREAAARRLSALGAAPPQVRRRGLTRAPLSYGQSRIYFLEQ 
LSPGKPLFNVPGAVRLRGPVDVARLSAAFGEIVRRHDALRTSIANVDGELLQIAQPHAGFALD 
VVTSTPEEAAELDRRLRAEAWRPFAIGAPPLLRATLFRiAEDEHVLLVTMHHWSDDWSLGVI 
LRELLALYAGRSLPPPRLQVSDFAAWQREMVESGALDGQRAYWRERLRGLSRAS I SAGGGAEA 
PSHD PSGAI EE I ALS PDKAAALEALARREGATLFMVLLALLDL V I HARSGALD I AVGTP I ANR 
NRPELEDWGLLTNTLVIRVDLARAGAFRDVLARARVQALDAFANQDIPFDVVTQDLKQERDH 
AQHPLPRVWLALQNAPKPALEVRGLRVEPLPLRPELVHFEVALLLWPADDGSWGHFEFRRDR 
VDEGARKEIAAAFTHLVDAVIARPDAPVSTLVEGARA5AARAQAALGEAFARAATARLGQLRR 
RSAGDRTPRE 

SEQ ID No 111 (Contig 12 >0RF2 ) 

MSIHIEEEGRADAAKAPPFDYLQALHSALAHENDPVKRKQIEAGMVFKWLREEFLPFLSQLRR 

EKPI FS I PAITLVTRYNDVVEVLNANDVFSVDNI APKLVE WGQNI LAMmD 

I^FPRADLPRYRQIVVDEANRLiAKVGVDKPFDLTGDYALRVPAGAMARYIiGVGEIPTEKVVA 

WTHALFNEIFLNPTNDPTAVAAARAARQEALPMIDAIVAARiCKQLAKSPPPEQFSVLDRYLVM 

QSVPETYESDEGIRDVILGLLMGCVDLSGGAIVNALVEI^KRPR\n^RDALNVVNVEDD;^ 

YVLEALRFRPPSTGVTSLCVRDYTVGRGTRHEEKVPAGALVMACSASAMHDHEHIDAPDQFRP 

GRLPSRNYLFWESGIHTCHGKYVAILHISLAIKQLLRAGVPSAIDPMPRVHGYPAPFRVRLAA 

AEG 
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SEQ ID No 112 (Contig 12 >0RF3) 

MREPSSTPNWRNFGSNLPAGSDSVPPGEGFPIKKIIJUiNLGKWKDTAGLQIAQALHLFEYGYK 

RYREGKFVLRATSDLGLGAIFESIDNWESFDQFEEFFKPWTFIRKPLVATRWAEDAEFGRQRL 

VGINPAHIRRATPADLADFVSGAEPKPIAIADGRTLEEVREGGQLYFLDYRIFKDIVDTDVQE 

ELGKYPLAPTCMLHQTAAGELLPVAIRLVHSRPGKGAHPDKIFTPSGPSDDWLTAKIAVASAD 

AIYQGQVTHLLYAHLIVEPFAVSTYRNLFATHPLHQLLRPHFFNTLAINELARRRFLGRGRFF 

DITSSVATMGSFELLTRAYTGKGIKGYGGKPWRFYESALPRDLSARDVRDLVGYHYRDDALLH 

WDAIQEYVGQVLKIAYPTPGSLSSDASLQRWIHELVSPQLGGMDSLLPPERADCLEKLTSLDD 

LIAIVTNI IFTATAYHAAVNFGQTDYYTWIPNAQFATYRSYGDVLNGSEKRQFKPLERLPGRA 

QSIRQMVLSRSLSMGPPLTSESLMTMKCLLQDPAAKQAFARYRERLAHIEREITERNRAREQP 
YLYLLPSMVPQSVAI 

SEQ ID No 113 (Contig 12 >0RF4 ) 

VSSSRSTGRVPRDRASPAGSCAPALVPGPPLSYASVMPPLDIiHVALFGASGAGKTVLLAAFYR 
AQTQPSFQQEYAYKIQAVNKAQGNQLLGRFYRLESGRFPDGSTRFDEYEFDFFPRDLPEPAVR 
IHWYDYPGRWWEDEPVDADEREAMRQGLIRLGMSQVGILLADGAKYRAEGTGYIRWLFEHFAD 
ECDRLRRASAATGDEVSFPREWIIxALSKADLCPPDYSARDFEREVCRDADDQLAKLCSVLRAE 
HAFGHRFMLLSSVAAPAGAQVDPRTSLGVRTLAPAILVSTVEGAVREAQAARKEKSAGETFFQ 
GLRDLVQFVDSLDDFLPKRYQIVSKILRFISIKDFATTRLDRLKKMREDAIRKGDTFTAVLTA 
MVAALRDDEGARAYHQNQ 

SEQ ID No 114 (Contig 12 >ORF5) 

MPAPAPLVETSRLLWRTRGEHWDYEFICVPEIPALPAWLSTLEAMLADADAGAGELRYGLLEI 
DDRGQRAPRAYPYVAVRFLDPARRDWTGRQVQHFAAWFPPVPPHAVAELPEAVPADWHLRVLD 
GLAG7YGSGEVFGLPEATIRAWKRSHDESRAARAMAIVKATPPVSLGGGEAAPSRWTRVPTLK 
KKPPEPPAAAGLLSVGAVPSGQGRRFGCFAIGAMMLAAFCRLMLACGVRLLGA 

SEQ ID No 115 (Contig 12 >0RF6) 

VRFRSSLGPLLLAALGAALTVSAAFRSAEASVFDSASRWPEDADGHVRIPVCIDPTSSAEQRV 
DGAAGGLIHAPNPSLADVITRVRTALQGSWERWSSVRFTGWESCDSLLPATRMTYVGVRIHPD 
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APNQSDSTGVYNKGGSVQFKPWGADFNRCIKYNWQTARVHIYSFDCVEQYAIHEMGHAIGFMHE 
VTHHPLVPSACSQREPLPASDVASGWPSSRRYIWNPGFYDYDSIMTYWSGCSDQDGVRFGSET 
LDAVDIQAVATVYPPVGGAPDVCNPGWFAGKRWFCAAQPTVSVGNSCSSGWVECLFHCNPRPF 
QGEWWTCPTNPYAVTGQSCSARWELCGD 

SEO ID No 116 (Concig 12 >0RF7) 

VGESQGALVGG^lALSTNALNLNAL^^LNAIJ^I^ALNLSGLSAR^«^ 

AASCALSSTASFDFSWTDSNGKRHDERYPGLLGVAPAWASGPLDDAGQRIVSSCVAARVNYYQ 
VPVLLSARSLRDPLKTLSSSQELIDYPDVEGAFWGNLFAAQPYINACYNSATVDNSRAYQRDC 
AAGHVTSGGQIVECGLIRIAGSCDRVCQKLNGAGQYYPSCVDRPGQSTATTKDVITTALP 

SEQ ID No 117 (Contig 12 >0RF8) 

VIJ^CERGGLTARAASLI^RGAHIJWVRRAYLDAEGCYGRVEALLGALLPEERRARGIJUiFR 
LGRHTEALADLAAAREAAAAASEAGAEIELLLDEAMILDWTGEYRAARERVAAAERIAGRVAS 
PLLGARLLLGVGRSLHRADREDEAAAVLTRAAAQAARLGDEGHETHIIALLLLGFILASLGRV 
ESAARDLDAVILSCEERSDLMHLGAALNNRGLARALQGDRAGMIADFERTIALGRELGQPAFE 
LVGRYNIiAEYLYLMDDIAAARPHARAVQAIAPRCGDRHAPVVVTLLIARLRLYQGDEAGARRI 
ALRLRAARDDAGCEALKPSEDVLCAMIELATRDDDRAAWAALEERSARCSVGQERIEVLEARA 

LJ^RRGRRADARAQLERAIJ^STI PTVMGGRLRRWYAELTRATESDAPDI DLAAAEATFTC 
ARAREKVEY 

SEQ ID No 118 (Contig 12 >0RF9) 

QAYPDLWAERGRQELWLRQLPPRACAQLAREALGDAADGALIDRLVTQSEGQPFFLEELIRAT 

AEGRGDALPETWAIWQVRLEAl^PPARRILRAASVLGEVFWRGAVAHLl^GDEAAPLAEH 

ALVAGELCVRHREGRFPGEEEYSFRQALLREGAYAQLTKDDRALGHRLAADWLEAAGEADPLV 

LAAHCERGGLTARAASLLARGAELAAARRAYLDAEGCYGRVEALLGALLPEERRARGLARFRL 

GRHTEAL^LAAAREAAAAASEAGAEIELLLDEAMILDWTGEYRAARERVAAAERLAGRVASP 

LLGARLLLGVGRSLHRADREDEAAAVl.TRAAAQAARIiGDEGHETHIIALLLLGFILASLGHVE 

EAARDIX)AVILSCEERSDLMHLGAAJJ^GLARALQGDRAGMIADFERTIALGRELGQPAFEL 

VGRYNIAEYLYLMDDLAAARPHARAVQAIAPRCGDRHAPVVVTLLIARLRLYQGDEAGARRIA 
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LRLRAARDDAGCEALKPSEDVLCAMIELATRDDDRAAWAALEERSARCSVGQERIEVLEARAL 

aalrrgrradaraqleraljuu^stiptvmggrlrrwyaeltratesdapdidiaa;^ 
rarekvey 

14. UNA sequence according to any of claims 1 to 5 wherein the 
DNA is selected from the group consisting of 

(a) the following DNA sequences: 

Seq ID No 119 (>Contigl7) 

TTACGTTACTCATCCTATCTCGGCACCCTGTGTCGGTGATGTCGCTCGCC 
TCGAGCGCGAGCGGGACGACGTCGGCGCCGCGCTCGGTGAGCGCCGCCGC 
GAC-GGCGCTCGCGAGATCGCTGGCGACGCCGGCCGGGGCCACGACGAGCC 
ACGTCCCCGCGACGTCGCCGCGTGACGCGGCGCTCACGGGTCTCCATTCG 
ACGCGGTAGCGCCACGCGCCCACGGTGCTCTGCTCTCGGCGGCTCCGCCG 
CCACGCCGACAGGGCCGGCATGAGGCTCTCGAGGGCCGAGCGCCGCCCGC 
TGTCGGCGACGTGGAGCGCGTCCGAGAGCGCCGCGACGTCGCCGCGCTCG 
ATGGCTCGCCAGAACGCGGTCTCCTCGGCGGACGCTCCCGGCGCCGCGTC 
CTCATCGTCCGACGCGTCGCCTGCGTCGAGCCAGAACCGCTCGCGCTGGA 
ACGCGTACGTCGGCAACGTCACGCGGCGCGCCCCGAGCGGAGCGAAGAAC 
GCACCCCAGTCGATGGCGTGCCCGCGCGCGTGGAGCTCGCCTGCCGAGAG 
GAGGAAGCGCTCGAGGTCGCCTTCGTCGCGGCGGAGCGAGGACACCACGG 
TCGCATCGCCGTCGATCGACGAGAGCGTCTCGTCGAGCGCGACGGTGAGC 
ACGGGGTGAGGGCTGACCTCGACGAAGAAGCGGTGGCCGTCGTCGAGCAG 
GGCGCGCGTGGCGTGCTCGAAGCGGACGGTGTGGCGCAGGTTTCGGTACC 
AGTGGGCGGCGCCGAGGGCCTCGCCATCAAGCCTCTCGCCCGTCACCGCG 
GA3TAGAGCGGCACGGTCGCCGGGCGCGGCGCGATGCCGTCGAGCGCCTC 
CAGCATCGTCCGCTCGATGGCCTCCACGTGGGCGGAGTGGGAGGCGTACT 
CGACGCGGACCTTGCGGGCGAACAGCTGCGCCCCGCTCAGCTCTGCGACG 
AGCTCGTCGATAGCGCCGGGGTCTCCGGAGACGAGGGCCGCGTGAGGGCT 
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GTTGATCGCCGCTATCGCCAGGCGTTCGCCCAAGGGCGCAAGGCGCGCCT 

10 CGAGCTCGGCGGTGGTGAGCTCGACGGCGGACATGGCGCCGCGTCCCGCG 
AGCTTCGTAATGGCGCGCGAGCGGAGCGCGACGACCCTGGCGGCGTCTTC 
TAGCGAGAGCGCGCCCGCGACGTACGCGGCCGCGA7CTCGCCCTGGCTGT 
5 GGCCGACGACCGCGTCGGGCGTGACTCCGGCGGCGCGCCAGGTGGCGGCG 
AGGGCGATCATGACGGCGAACAGCACGGGCTGCACCACGTCGACGCGCTC 
GAGCATGGGCGCGGCGTGCGCTTCGTCGCCGCCGAGCACGGCGAGGAGCG 
ACCAGTCGACGTGCGGCGCCAGGGCGCGCTCGCACGCCTCGATCTCGGCC 

2^ CGAAAGGCGGGCGAGGAGGCGAGCAGAGCGCGCGCCATCGATGGCCACTG 
10 CGAGCCCTGGCCGGGGAAGACGAAGGCGACCTTGCCCGGCGGGAGCGCCT 
CGCCCGCGACCGTTCCTGCCCCCGCGCGCCCCTCGGCGAGCGCCGCGAGC 
GCCGAGAGCAGCGCGGCGCGATCGTCTGCCACGACGGCGGCGCGACGCTC 

25 GAAATGCGACCGCGTGGTCGCGAGCGACGCCGCGACGTCGACGAGGGCGA 
CGTCCTCGTGCTCGGCGAGGTGCGCGTGGAGCTTGCCCGCCTGAGCGCGG 
i 5 AGCGCCGCGTCGCTCTTCGCCGAGAGGAGCACCGGCACCGGCGGCGCGAA 
GGGCGCGCGGGCGGGCTCCCCGGCCTGGTCGTCGCCGGCCGCCGCGCGCG 
GCGCTTCCTCGAGGACCACGTGCGCGTTGGTGCCGGAGATCCCGAACGAC 
GACACCGCCGCGCGCCGAGGAGACCCGCCTGGCTTCCACGGTACCTCCTC 
GGTCAAGAGGCGGATCGCGCCGGACGACCAATCGATGTGCTGCGACGGGC 

25 2 0 TCGCGGCGTGGAGCGTCCTCGGGAGGACGCCGCTCTGCAGCGCGAGCACC 

ATCTTGATGACGCCGCCGATCCCCGCGGCGGCCTGCGTGTGCCCGAGGTT 
CGACTTTAGGCTCCCGAGCCACAGCGGGCGCTCCTTCGCGTGCGCCGCGC 
CGTACGTCGCGAAGAGCGCGCGCGCCTCGATGGGATCGCCGAGCGTCGTG 

"^0 CCGGTTCCGTGCGCCTCGACGGCGTCGACGTCCGCGGGGGCGAGCCCCGC 
2S GCTCGCGAGCGCGTCCCGGATCACGCGCTCTTGCGCGGGGCCGTTCGGCG 
CCGTGAGCCCTTGGCTCTTGCCGTCCTGGTTGACGGCCGATCCGCGCACG 
ATCGCGAGCACGGGGTGCCCGTTCTTCCGGGCGTCCGACAGGCGCTCGAG 

45 

GAGCACTATCCCAGCGCCTTCCGACCAGCCCGCGCCGTTCGCGTGCGACG 
AGAACGACTTGCACCGCCCGTCCGGCGCGCCCGCGTGCTGCGCGCTGAAC 
3 D TCGCCGAAQATCCCGGGGGTCGCCATCACGGTCACGCCGCCGGCGAGCGC 

50 
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15 



20 



30 



35 



45 



15 



GAGCGAGCACTCGCCTCGACGGATGGCGTGGCAGGCGAGGTGGAGCGCGA 
10 CGAGCGACGAGCTGCACGCCGTGTCGACGCT 

Seq ID No 120 (>Contigl8) 
5 TTTTAGGANCCCCGACGTGCACGATCGGCTCGCCAACCTCGTGGCGCGCC 
GGGACTATTTTTACCAGCTCGCGTTGCGCGCCGCGGGGACCTACGTGCGG 
GGCCTCGTCCGCGCCCCGCACGACGGCGCGCGCCCCCCCGCGTTCGCGCC 
GCGTGGGGCGGCGCTCGTCACGGGCGGGACCGGGGCGCTCGGGGCGCACG 
TTGCCCGTTGGTTCGCGCGGATCGGCGCCGAGCACATCGTGCTCGCGAGC 
10 CGCCGCGGAGCCGCGGCCCCCGGCGCGGCCGCGCTCGCCGAGGAGCTTTC 
GGTGCTCGGCGCGCGCGTGACGCTGGTTGCGTGCGACGTCCCCa^TCGTG 
AGGCGGTCGCGGGGCTCGTGCGCAACGTCAAGGCCGGCGGAGCGACGGTG 
25 CGCGCCGTGTTCCACGCGGGCGGTGCGATGCACGAGGCGCCGGTCGCCGC 
CATGCGTGTTGAGGAGCTCGCCGACGCGATCGCCGTGAAGGCCCGCGGCG 
CGCAGCACCTCCAAGACGTCTTCGCGCAGCGCCCGCTCAACGCGTTTGTC 
CTCTTCTCGTCAGAAACGGGTGTGTGGGGCGGTGGCCGGCAAGGCGCGTA 
CGCCGCGGCGAACGCGTTCCTCGACGCGCTCGCCGAGGCGCGTCGCGCGG 
ACGGCCTCGCGGCGACCTCGATCGCGTGGGGCGCGTGGGCGGGCGGCGGA 
ATGCTCGCGACCGACGCCGAGCGGCGCTTGAAGCATCGCGGCGTCGCGCC 
GATGGATCCGGAGCTCGCCGTCGCGGCCCTCGCGCACGCGCTCGATCACG 
CCGAGACGTGCCTCGCCGTCGCTGACGTCGACTGGGCGCGCTTCGCCCCG 
TCGTTCGCCTCGGCGCGTCCTCGCCCGCTCCTCGACGAGCTCGCGGAGGC 
GCGATCGGCGCTCGACGCGCTGCGCGAGCCACCGGACGACGCGCGCACGG 
40 CCGCCGGTCCCGAGCCCGCAAGCACGCTGAGGACCACGCTCGCGGCGCTC 
2 5 CCGGAGGGCGAGCGCCACCGCCACCTCCTCGCGCTCGTGCGGACGGAGAC 
•GGCGGCGGTGCTCGGGCACGCGGACGCGTCGCGCGTCGAGCCGAACCGCG 
GGTTCTTTGACCTCGGGCTCGACTCGCTCATGTCCGTCGAGCTCCGCAGG 
CGCGTCCAGCGCGCGACCGGCATCAAGCTCCCGGCGACGCTCGCGTTCGA 
CCACCCGACGCCGAGCGCGCTCGCGAGCAAGGTGCTCGCCGCGATCGTCC 
30 TCCACGACGCGACCCCGCGCGCCTCGCCCGCCGCGGAGCTCGAGCGCCTC 
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GAGGGGATGCTCTCGGCGATCTACGCGGACGAAGCGCTCCGCGACGACCT 
10 ^ACGGCGCGCCTCCGCGCCTTCCTGGACAAGCGCGCGGTCCGCACCGAAC 
GCCCCGACGACGCCGCGTTCGCCGAGAAGCTCGGCTCCGCGAGCGCCGAC 
GAACTCATTCGCCTGATCGATCAGAAGCTCGGAGATCGCATCGATGTCGA 
5 CCGTTACTAACGACACGCTCACGGAGTACTTGCGGCGCCTCACTCAAGAG 
^5 CTCCACAGGAGCGAGACGCGCCTGCGTGCGACGGAAGAGAGGCGACATGA 
GCCGATCGCCATCGTCGGCCTCGGGCTCCCCTTCCGGGGCGGGATCCACG 
ACCGCGACACGCTCTGGACGTTCCTCGAGGAGGGCCGCGACGCCATCGCG 
CCGATCCTCGCGAGCCGC7GGAACGCGGACGCGACGTACGACCTCGATCC 
10 GGACGCCGTCGGCAAGAGCTACGTGCGCGACGCCGCCATGCTCGATCGCG 
TCGACCTTTTCGACGCCGATTTCT7CGGGATCAGCCCGCGCGAGGCGAAG 
TACGTCGACCCGCAGCACCGCCTCTTGCTCGAGACGTCGTGGCAAGCGCT 
CGAGGACGCGGGGATTGTGCCGGCGTCGCTGCGAGACTCGAAGACCGGCG 
TCTTCGTCGGCACGGGCGCGAGCGACTACGCGTTCCTCCAGAGCGATCGC 
1 5 GACGCCTCGGAGGCGTACGCGTTCATGGGGATGATCTCGTCGTTCGCGGC 
GGGCCGCCTCGCGTTCACGCTCGGGCTCCAAGGCCCCGCGCTATCGATCG 
ACACGGCGTGCTCTTCGTCGCTCGTCGCGCTCCACCTCGCGTGCCAGTCG 
CTGCGTCAAGGCGAGTGCGACCTCGCGCTCGTCGCGGGTGTGCAGGTCAT 
GTCGTCGCCGGAGGTGTTCGTGCTGCTCTCGCGCACGCGCGCGCTCGCGA 
2 0 GCGACGGGCGATCGAAGACGTTCTCGGCGAACGCCGACGGCTATGGCCGC 
GGCGAAGGCGTCGTCGTCCTGGCCGTCGAGCGCCTCCGCGACGCGCGCGC 
GAAAGGGCGCCCGATCCTCGCGGTGATCCGCGGCAGCGCGGTGAACCACG 
ACGGCACGTCGAGCGGGATCACGGTCCCGAACGGGCCCGCGCAGCAGAAG 
40 GTGCTCCGCGCCGCGCTCGACGACGCGCGGCTTGTCCCCGCCGACGTCGA 

2 5 CGTCGTCGAGTGCCACGGCACGGGGACCTCCATCGGCGATCCCATCGAAG 

TGAACGCGCTCGCCGCCGTCTACGGCGAGGGGCGCCCCAAGGACCGCCCG 
CTGTTCCTGGGCGCGCTGAAGACCAACATCGGGCACCTCGAGTTCGCGTC 
GGGCCTCGCCGGCGTCGCGAAGATGGTCGCCTCCATGCGCCACGCGACCC 
TCCCCGCGACGCTGCACACGAGCCCGCTCAACCCGCTCGTCGACTGGGAC 

3 0 GCGCTCCCQGTGCGCGTCGTCGACC-CCGCGCGCCCGTGGACGCGCCGCGA 
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CGACGGCGCCCCCCGGCGCGCCGGCGTCACGGCGATCGTCGAGGAGGCGC 

CCGCCGAGCCCGAGCCCACGACGCCCGACGCCGCGCCCGCGCTTCCGGCC 

GTGCCCGTTCTCCTCTCGGGCAAGACCGACGAGGCGCTGCGCGCGCAGGC 

AGCGCGCCTCCACGCGCACCTCGCGGGGCGCCCCGACGCGCGGCTCGTCG 

ACATCGCCGCGTCGCTCGCGACGACGCGCACGCACTTCGATCGACGCGCG 

GCCGTCGTCGCGGCGGATCGCGACGAGCTCCTCGGCGCGCTCGACGCGCT 

CGCGCGCGGCGAGGCAGGCCCGGGGTCGGTCGTCGCGAGCGCGATCCCCG 

CCGGCAGGGTCGTGTTCGTGTTCCCCGGCCAAGGCTCGCAGTGGGTCGGG 

ATGGCGCGCGCGCTCCTCGCGTCGTCGGTGGTCTTCCGCGACGAGATCGC 

GGCCTGCGAGCGCGCGCTCGCGCCGCACGTCGCCTGGTCGCTCGGCGCCG 

TTCTCCGGGGCGACGGCGACGAGGCGACGCTCCTCGGCCGCGTCGACGTC 

GTGCAGCCGGTCCTCTTCGCCGTCATGGTCGCCCTCGCCGCGCTCTGGCG 

CTCGATCGGCGTCACGCCCGACGCCGTCGTCGGGCACAGCCAAGGCGAGA 

TCGCCGCCGCCTACGTCGCCGGCGCCCTCTCGCTCGAAGACGCCGCCAAG 

GTCGTCGCGCTGCGCGCACGAGCGCTCACGAAGATCGCGGGGCGCGGGGC 

GATGGCCGCCGTCGAGCTCGGCGCACGCGACACCGAGGCGCGCCTCGCGC 

CGTTCGGCGACGCCATCGCGATCGCGGCGATCAACAGCCCGCGCGCCACG 

CTCGTCGCGGGCGACACGGACGCGATCGACGCGCTCGTCCGCGACCTCGA 

GGCCGCGCAGATCTTCGCGCGGAAGGTGCGTGTCGACTACGCGTCGCACT 

CGGCGCACGTCGAGGCGATCGAGCGCGAGCTCCTCGCGGATCTCGCGGGG 

ATCGAACCGCGCGCGGGCGCTGTGCCGCTTTACTCCGCGGTGACGGGCGC 

GAAGCTCGACGGGAACCGCCTCGACCCCGCGCATTGGTTCCGGAACCTGC 

GCTCGACAAAAAACTTTGAGGACGCCACGCGCGCGCTCCACGACGACGGC 

CGCCGGGTATCCTCATNATCNNGGGCGTNCAGAGGAGTCGGTATTNCCCC 

CCCCCGCCTTNCCCG , 

or their complementary scrands, 
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(b) DNA-sequences which hybridise under stringent condicions 
CO regions of DNA-sequences according to (a) encoding proteins 
or to fragments of said DNA-sequences, 

(c) DNA-sequences which hybridise to the DNA-sequences accord- 
ing CO (a) and (b) because of a degeneration of the genecic 
code, 

(d) allele variations and mutants resulting by substitution, 
insertion or deletion of nucleotides or inversion of nucleotide 
segments of DNA-sequences according to (a) to (c) , wherein the 
variations and mutants offer isof unctional expression products. 

15. Peptide encoded by a DNA sequence according to claim 14 
selected from the group consisting of 

Seq ID No 121 

>Contigl7_001 828 amino acids MW=86259 D pl=5.60 numambig=>l 

MTVMATPGIFGEFSAQHAGAPDGRCKSFSSHANGAGWSEGAGIVLLERLSDARKNGHPVL 

AIVRGSAVNQDGKSQGLTAPNGPAQERVIRDALASAGLAPADVDAVEAHGTGTTLGDPIE 

ARALFATYGAAHAKERPLWLGSLKSNLGHTQAAAGIGGVIKNTVLALQSGVLPRTLHAAS 

SQHIDWSSGAI RLLTEEVPWKPGGSPRRAAVS S FGI SGTNAHVVLEEAPRAAAGDDQAGE 

PARAPFAPPVPVLLSAKSDAALRAQAGKLHAHLAEHEDVALVDVAASLATTRSHFERRAA 

WADDRAALLSAIJUOAEGRAGAGTVAGEALPPGKVAFVFPGQGSQWPSriARALLASSPA 

FRAEIEACERAIAPHVDWSLLAyLGGDEAHAAP^^ERVDWQPVLFAVMIAIJ^A 

VTPDAWGHSQGEIAAAYVAGALSLEDAARWALRSRAITKLAGRGAMSAVELTTAELEA 

RLAPLGERLAIAAINSPHAALVSGDPGAIDELVAELSGAQLFARKVRVEYASHSAHVEAI 

ERTMLEALDGIAPRPATVPLYSAVTGSRLDGEAI.GAAHWYRNLRHTVRFEHATRALLDDG 

HRFFVEVSPHPVLTVALDETLSSIDGDATWSSLRRDEGDLERFLLSAGELHARGHAIDW 

GAF FAPLGARR VTL PTYAFQRERFWLDAGDAS DDEDAAPGASAEETAFWRAI ERGD VAAL 
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SDALHVADSGRRSALESLMPALSAWRRSRREQSTVGAWRYRVEWRPVSAASRGDVAGTWL 
WAPAGVASDLAS ALAAALTERGADWPLALEASD I TDTGCRDRMSNVX 

Seq ID N'o 122 

>Contigl8_002 502 amino acids MW=53019 D pl=6.83 numambig=l 

FRXPDVHDRLANLVARRDYFYQLALRAAGTYVRGLVRAPHDGARPPAFAPRGAALVTGGT 

GALGAHVARWFARIGAEHIVLASRRGAAAPGAAALAEELSVLGARVTLVACDVPDREAVA 

GLVRNVTCAGGATVRAVFHAGGAMHEAPVAAMRVEEIJUDAIAVKARGAQHLQDVFAQRPl^ 

AFVLFSSHTGWGGGRQGAYAAANAFLDALAEARRADGLAATSIAWGAWAGGGMLATDAE 

RRLKHRGVAPMDPEIAVAAIJUiALDKAETCLAVADVDWARFAPSFASARPRPLLDELAEA 

RSALDALREPPDDAJITAAGPEPASTIRTTLAALPEGERHRHLLALVRTETAAVLGHADAS 

RVEPNRGFFDLGUIfSLMSVELRRRVQRATGIKLPATIJ^FDHPTPSALASK^nJU^IVLHDA 

TPRASPAAELERLEGMLSAIYADEALRDDLTARLRAFLDKRAVRTERPDDAAFAEKLGSA 

SADELIRLIDQKLGDRIDVDRY* 

Seq ID No 123 

>Concigl8_010 840 amino acids MW=88062 D pl=5.74 numambig=6 

MSTVTNDTLTEYLRRLTQELHRSETRLRATEERRHEPIAIVGLGLPFRGGIHDRDTLWTF 

LESGRDAI AP I LASRWNADATYDLDPDAVGKS YVRDAAMLDRVDLFDADFFGI S PREAKY 

VDPQHRLLLETSWQALEDAGIVPASLRDSKTGVFVGTGASDYAFLQSDRDASEAYAFMGM 

I S S F AAGRLAFTLGLQGPALS I DTAC S S S LVALHLACQ S LRQGECDLALVAGVQ VMS S P E 

VFVLLSRTRALASDGRSKTFSANADGYGRGEGWVLAVERLRDARAKGRPILAVXRGSAV 

NHDGTSSGITVPNGPAQQKVLRAALDDARLVPADVDWECHGTGTSIGDPIEVNALAAVY 

GEGRPKDRPLFLGALKTNIGHLEFASGIAGVAKMVASMRHATLPATLHTSPLNPLVDWDA 

LPVRWDAARPWTRRDDGAPRRAGVTAIVEEAPAEPEPTTPDAAPALPAVPVLLSGKTDE 

ALRAQAARLHAHLAGRPDARLVDIAA3LATTRTHFDRRAAVVAADRDELLGALDALARGE 

AGPGSWASAIPAGRWFVFPGQGSQWGMARALLASSWFRDEIAACERALAPHVAWSL 

GAVLRGDGDEATLLGRVD WQPVLFA^/'MVALAALWRS IGVTPDA WGHSQGE I AAA YVAG 

ALSLEDAAKWALRARALTKIAGRGAMAAVELGARDTEARLAPFGDAIAIAAINSPRATL 

VAGDTDAI DALVRDLEAAQI FARKVRVDYASHS AKVEAI ERELLADLAGI EPRAGAVPLY 
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16. Recombinanc expression vector which comprises a DNA- 
5 sequence according to any of claims 1 to 10, 12 and 14. 

17, Procaryocic or eucaryotic cell which has been transfected 
or transformed with a DNA-sequence according to any of claims 1 
to 10, 12 and 14 or with a recombinant expression vector ac- 

10 cording to claim 15. 



18. Cell according to claim 17. wherein the cell is derived 
25 from myxobacteria. 

15 19. Cell according to claim 17, wherein the cell is derived 
from a Sorangium strain. 

30 

20. Ceil according to claim 17, wherein the cell is- derived 
from Sorangium cellulosum. 

20 

35 

21. Cell according to claim 17, wherein the cell is derived 
from a Streptomyces strain. 

22. Cell according to claim 17, wherein the cell is derived 
25 from Escherichia coli. 



23. Process f or , an enzymatic biosynthesis, mutasynchesis or 
partial synthesis of polyketide or heteropolyketide compounds, 
wherein a cell according to any of claims 17 . to 22 is culti- 
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vated in a suitable culture medium and the polyketide or het- 
eropolyketide compound is isolated from the medium. 

24. Process according to claim 23, wherein the polyketide or 
heteropolyketide compound is an epothilone. 
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